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Abstract. The areas of Ramsey Theory and Random Graphs have been closely linked 
every since Erdos' famous proof in 1947 that the 'diagonal' Ramsey numbers R(k) grow 
exponentially in k. In the early 1990s, the triangle-free process was introduced as a model 
which might potentially provide good lower bounds for the 'off-diagonal' Ramsey numbers 
i?(3, k). In this model, edges of K n are introduced one-by-one at random and added to the 
C*\ graph if they do not create a triangle; the resulting final (random) graph is denoted G„.a- 

O In 2009, Bohman succeeded in following this process for a positive fraction of its duration, 

and thus obtained a second proof of Kim's celebrated result that R(3,k) = 0(fc 2 /logfc). 

In this paper we improve the results of both Bohman and Kim, and follow the triangle-free 
process all the way to the end. In particular, we shall prove that 
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e(G nA ) = (^+ (l))n 3 /Vlogn, 



O 

u 

with high probability as n — > oo. We also obtain several 'pseudo-random' properties of G b ,a , 
(H and use them to bound its independence number, which gives as an immediate corollary 



^4 / log A: 

This significantly improves Kim's lower bound, and is within a factor of 4 + o(l) of the best 
known upper bound, proved by Shearer over 25 years ago. 

> 

On 

\q l. Introduction 



For more than eighty years, since the seminal papers of Ramsey [44] and Erdos and 
Szekeres [32], the area now known as Ramsey Theory has been of central importance in 
Combinatorics. The subject may be summarised by the following mantra: "Complete chaos 
is impossible!" or, more precisely (if less poetically), "Every large system contains a well- 
ordered sub-system." The theory consists of a large number of deep and beautiful results, as 
well as some of the most important and intriguing open questions in Combinatorics. These 
open problems have, over the decades, been a key catalyst in the development of several 
powerful techniques, most notably the Probabilistic Method, see [6]. 

The archetypal Ramsey-type problem is that of bounding Ramsey numbers. The basic 
question is as follows: for which n G N does it hold that every red-blue colouring of the edges 
of the complete graph K n contains either a red or a blue K{! The Ramsey number, 
denoted R(k,£), is defined to be the smallest such integer n. Shortly after Ramsey [44] 
proved that R(k, £) is finite for every k and £, Erdos and Szekeres [32] proved in 1935 the 
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explicit upper bound 

which implies in particular that R(k) := R(k,k) ^ 4 fc , and that R(3,k) = 0(k 2 ). Despite 
an enormous amount of effort, the former bound has only recently been improved by a 
super-polynomial factor, by Conlon [24], who refined the earlier method of Thomason [52]. 
A constructive super-polynomial lower bound was not obtained until 1981, by Frankl and 
Wilson [34], whose beautiful proof used techniques from linear algebra; however, an exponen- 
tial lower bound was given already in 1947 by Erdos [26], whose idea ("Colour randomly!") 
has probably been the most influential two-word proof in the development of Combinatorics. 
Over the past 65 years this bound has only been improved by a factor of two, by Spencer [49] 
in 1975, using the Lovasz Local Lemma [28]. In summary, the current state of knowledge in 
the 'diagonal' case is as follows: 

-„(!))*. 2*^ <fl(*)<«p(- n (l)gg) -4» 

After the diagonal case, the next most extensively studied setting is the so-called 'off- 
diagonal' case, £ = 3, where much more is known. As noted above, it follows from the Erdos- 
Szekeres bound (1) that R(3,k) = 0{k 2 ) and, in a visionary paper from 1961, Erdos [27] 
proved a lower bound of order /c 2 /(log/c) 2 by applying a deterministic algorithm to the 
random graph G(n,p). An important breakthrough was obtained by Ajtai, Komlos and 
Szemeredi [3, 4] in 1980, who proved that R(3,k) = 0(k 2 / log k), and a little later by 
Shearer [48], who refined the method of [3] and obtained a much better constant. However, 
it was not until 1995 that a complimentary lower bound was obtained, in a famous paper 
of Kim [38]. We remark that the papers [4] and [38] were particular important, since they 
(respectively) introduced and greatly developed the so-called semi-random method. 

More recently, Bohman [10] gave a new proof of Kim's result, using the triangle- free process 
(see below). In this paper, we shall significantly improve both of these results, following the 
triangle-free process to its (asymptotic) end, and proving a lower bound on R(3, k) that is 
within a factor of 4 + o(l) of Shearer's bound. We remark that very similar results have 
recently been obtained independently by Bohman and Keevash [14] using related methods. 

1.1. Random graph processes. The modern theory of random graph processes was initi- 
ated by Erdos and Renyi [29, 30] in 1959, who studied the evolution of the graph with edge 
set {ei, . . . , e m }, where (ei, . . . , e^) is a (uniformly chosen) random permutation of E(K n ). 
This model, now known as the Erdos-Renyi random graph, is one of the most extensively 
studied objects in Combinatorics, see [20] or [37]. Two especially well-studied problems are 
the emergence of the 'giant component' around m = n/2, see [17, 30, 36] (or, more recently, 
e.g., [25]), and the concentration (and location) of the chromatic number [1, 18, 19, 40, 47], 
where martingale techniques, which will play a important role in this work, were first used 
to study random graph processes. 

In general, a random graph process consists of a randomly chosen sequence of edges e e 
E(K n ) N , and a (random or deterministic) rule which determines the graph G m as a function 
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of the sequence (ei,...,e m ) (and often just of the graph G m -\ and the edge e m ). The 
study of these objects has exploded in recent years, as the ubiquity of random-like networks 
in nature has come to the attention of the scientific community. Particularly well-studied 
processes include the 'preferential attachment' models of Barabasi and Albert [7], and the so- 
called 'Achlioptas processes' introduced by Achlioptas in 2000 (see, e.g., [2, 9, 11, 15, 50]), 
and studied most notably by Riordan and Warnke [45, 46]. We mention also the very 
general class of 'inhomogeneous random graphs' introduced recently by Bollobas, Janson 
and Riordan [22]. 

A technique which has proved extremely useful in the study of random graph processes 
is the so-called 'differential equations method', which was introduced by Wormald [55, 56] 
in the late 1990s. In this method, the idea is to 'track' a collection of graph parameters, 
by showing that (with high probability) they closely follow the solution of a corresponding 
family of differential equations. This method has been used with great success in recent 
years, see for example [8, 10, 12, 13]; we note in particular the recent result of Bohman, 
Frieze and Lubetzky [12] that the 'triangle removal process' ends with n 3 ^ 2+ °^ edges. 

1.2. The triangle-free process. Consider the following random graph process (G m ) rne n 
on vertex set [n] = {1, . . . , n}. Let Go be the empty graph and, for each m G N, let G m 
be obtained from G m -\ by adding a single edge, chosen uniformly from those non-edges of 
G m _i which do not create a triangle. The process ends when we reach a maximal triangle- free 
graph; we denote by G n ,A this (random) final graph. 

The triangle-free process was first suggested by Bollobas and Erdos [21] at the "Quo 
Vadis, Graph Theory?" conference in 1990 as a possible method of obtaining random graphs 
with good Ramsey-type properties, and was first studied rigorously by Erdos, Suen and 
Winkler [31], who showed that, with high probability 1 , G n .A has at least crc'l 2 edges for 
some constant c > 0. Determining the order of magnitude of e{G n ^) remained an open 
problem for nearly 20 years until the breakthrough paper of Bohman [10], who followed the 
triangle-free process for a constant proportion of its lifespan, and hence proved that 

e(G nA ) = e(n 3 / 2 v^). (2) 

Shortly afterwards, Bohman and Keevash [13] extended and generalized the method of [10] 
to the setting of the H-free process, where the triangle is replaced by an arbitrary 'forbid- 
den' graph H. Improving on earlier results of Bollobas and Riordan [23] and Osthus and 
Taraz [42], they proved that for any strictly balanced 2 graph H, the number of edges in the 
final graph G n> n satisfies 

e(G n , H ) > c-n 2 - l l m ^ H \\ognf^ H ^ 

for some constant c > which depends on H, which is conjectured to be within a constant 
factor of the truth. (See [43, 53, 54] for upper bounds in the cases H = K 4 and H = Cg.) 

throughout the paper, we shall write 'with high probability' to mean with probability tending to 1 as 
n — > oo, where n = |V(G n) A)| is the size of the vertex set in the triangle-free process. In practice, the 
probabilities of our bad events will all be at most n~ logn , see for example Theorem 6.9. 

2 m 2 (i?) := max l^plZ^ i an d a graph is said to be strictly balanced if < ^pp^ for every F C H. 
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Although in this paper we shall focus on the case H = K 3 , we believe that our methods can 
also be applied in the general setting, and we plan to return to this topic in a future work. 

We shall follow the triangle- free process until o(n 3//2 ^log nj steps from the end, and hence 
obtain a sharp version of Bohman's Theorem. Our first main result is as follows. 

Theorem 1.1. 

e(Gn A ) = (^ + o(l))n 3 / 2 v/loI^ 
with high probability as n — >• oo. 

We shall moreover control various parameters associated with the graph process, showing 
that they take the values one would expect in a random graph of the same density. Thus, 
one may morally consider the main result of this paper to be the following imprecise state- 
ment: "G„,a is well-approximated by a random graph with no triangles." One significant 
consequence of this pseudo-theorem (or rather, of the precise theorems stated below), is that 
we can bound (with high probability) the independence number of G n> A, and hence obtain 
the following improvement of Kim's lower bound on R(3, k). 

Theorem 1.2. 

as k — > oo. 

We repeat, for emphasis, that the upper bound in Theorem 1.2 was proved by Shearer [48] 
over 25 years ago. We remark that the factor of 4 + o(l) difference between the two bounds 
appears to have two separate sources, each of which contributes a factor of two. To see this, 
recall that Shearer proved that 

a(G) > (l-o(l))^ (3) 

for any triangle-free n-vertex graph with average degree d, which implies the bound stated 
above, since the independence number of such a graph is clearly also at least d. Our results 
(see Theorem 2.12, below) imply that (3) is within a factor of two of being best possible 
(in the critical range), and we suspect that G n ,A is asymptotically extremal. Moreover, the 
independence number of G„ 5 a is (perhaps surprisingly) roughly twice its maximum degree, 
rather than asymptotically equal to it. Motivated by the heuristic that 'random is best' for 
many Ramsey-type problems, we conjecture that our bound is in fact sharp. 

Conjecture 1.3. 



R(3,k) = 




as k oo. 

We remark that, as in the diagonal case, the best known constructive lower bound for 
R(3, k) is quite far from the truth. To be precise, Alon [5] constructed a triangle-free graph 
with 0(/c 3//2 ) vertices and no independent set of size k. 
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We finish this section by giving a rough outline of the proof of Theorems 1.1 and 1.2; a 
more extensive sketch (together with several of our other main results) is given in Section 2. 
Our plan (speaking very generally) is to use the so-called 'differential equations method' 
whose application in Combinatorics was pioneered by Wormald [55, 56]. However, we can 
assure the reader who is intimidated by this technique that no actual differential equations 
will be needed in the proof. Moreover, apart from the use of a martingale concentration 
inequality due to McDiarmid [41], our proof will be completely self-contained. 

The basic idea is to 'track' a large (but finite) collection of graph parameters, such that 
the (expected) rate of change of each depends only on some (generally small) subset of the 
others. We shall show that, for each of these parameters, the probability that it is the first 
parameter to go astray (that is, to have normalized error larger than 1) is extremely small; 
the lower bound in Theorem 1.1 then follows by applying the union bound to our family 
of events. To prove Theorem 1.2 and the upper bound in Theorem 1.1, we assume that 
these parameters were all tracking up to step m* (the point at which we lose control of the 
process), and show (by bounding the probability of some carefully chosen events) that 

a(G nA ) ^ a(G m *) < (y/2 + o(l)) y/nlogn 

with high probability, and that the maximum degree of G m is unlikely to increase by more 
than o(\/n log n) between step m* and the end of the process. 

The most basic parameter we shall need to track is the number of open edges in G m , where 

0(G m ) = {e G E{K n ) \ E{G m ) : e % N G Jv) for every v G V(G m )}. (4) 

We shall write Q{m) = \0{G m )\. Observe that the open edges of G m are exactly those which 
can be added to the graph at step m+1. In order to control Q(m), we shall need to track the 
parameters X e (m) and Y e (m) (see Figure 2.1) for each e G 0(G m ); these variables control 
the number of edges which are closed at each step of the process, and (together with Q(m)) 
they also control their respective derivatives. The collection 

Q(m) U (X e (m) : e G 0(G m )} U {Y e (m) : e G 0(G m )} 

thus forms a closed system, and for this reason Bohman [10] was able to track each of these 
variables up to a small (but rapidly growing) absolute error. 

In order to prove Theorem 1.1, we shall need to control these parameters up to a much 
smaller absolute error; in fact the error term we need decreases super- exponentially quickly 
in t — m • n~ 3 / 2 . In order to obtain such a tiny error, we shall exploit the self-correcting 
nature of the triangle-free process 3 ; doing so requires three separate steps, which are all new 
and quite different from one another, and each of which relies crucially on the other two. 

First, we show that Q(m) evolves (randomly) with X(m) and Y{m) (the averages over all 
open edges of G m of the variables X e {m) and Y e (m), respectively) according to a 'whirlpool- 
like' structure (see Section 6). Using a suitably chosen Lyapunov function, we are able to 
show that this three-dimensional system is self-correcting, even though Q(m) itself is not. 

Second, for each k G N and e G 0(G m ), we track a variable v} k \m), which is (roughly 
speaking) the k th derivative of Y e {m). To define this variable, consider for each m G N 



'For previous applications of self-correction in other settings, see [12, 16, 51]. 
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the graph (the 'F-graph' of G m ) with vertex set 0(G m ), and an edge between each pair 
{/,/'} such that /' G Yf(m); then Ve k \m) is the weighted average 4 of Yf(m) over the edges 
/ G 0(G m ) at walk-distance k from e in the F-graph. Crucially, our error bounds on these 
variables decrease exponentially quickly in k, and using this fact we shall be able to prove 
self-correction. A vital ingredient in this calculation amounts to showing that a random 
walk on the F-graph mixes in constant time (see Sections 5.1 and 5.2), and the proof of this 
property of the F-graph uses the fact that we can track certain 'ladder-like' graph structures 
in G m . We consider the technique introduced in this part of the proof, which is performed 
in Section 5, to be the most important innovation of this paper. 

Finally, in order to control the number of 'ladder-like' structures, we shall in fact track 
the number of copies of every graph structure F which occurs in G m (at a given 'root'), up 
to the point at which it is likely to disappear, and after this time we shall bound the number 
of copies up to a polylog-factor. Such a general result is not only interesting in its own right; 
it is necessary for our proof to work, because (for our martingale bounds) we need to track 
the maximum possible number of copies of each structure which are created or destroyed in 
a single step of the triangle-free process, which depends on (the number of copies of) several 
other structures, some of which may be tracking, and others not. This part of the proof is 
extremely technical, and making it work requires several non-trivial new ideas; we mention 
in particular the 'building sequences' introduced in Section 4.1. 

The rest of the paper is organised as follows. In Section 2 we give a detailed outline of the 
proofs of Theorems 1.1 and 1.2, and state various further results about the graph structures 
G m . In Section 3 we introduce our martingale method, which is different from that used 
in [10], and use it to track the variables X e , which form a particularly simple special case. 
In Sections 4-6 we prove the lower bound in Theorem 1.1; more precisely, in Section 4 we 
study general graph structures, in Section 5 we control the variables Y e (m) and vj k \m), 
and in Section 6 we track X(m), Y{m) and Q(m). Finally, in Section 7, we study the 
independent sets in G n> A, and deduce the upper bound in Theorem 1.1 and the lower bound 
in Theorem 1.2. To avoid distracting the reader from the key ideas with too much clutter, 
we postpone a few of the more tedious calculations to an Appendix [33]. 

2. An overview of the proof 

In this section we shall lay the foundations necessary for the proofs of Theorems 1.1 and 1.2. 
In particular, we shall formally introduce the various families of variables which we will need 
to track, and state some of our key results about these variables. We shall attempt to give the 
reader a bird's-eye view of how the various components of the proofs of our main theorems 
fit together, whilst simultaneously introducing various notations and conventions which we 
shall use throughout the paper, often without further comment. We strongly encourage the 
reader, before plunging into the details of Sections 4-7, to read carefully both this section, 
and Section 3.4. In what follows, the main heuristic to keep in mind at all times is that G m 
should closely resemble the Erdos-Renyi random graph G(n,m), except in the fact that it 
contains no triangles. 



The weight of an edge / in V} (m) is equal to the number of walks of length k from e to /. 
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We begin by choosing some parameters which will be fixed throughout the paper. Let e > 
be an arbitrary, sufficiently small constant. Given e, we choose a sufficiently large constant 
C = C(e) > 0, and a function oo = oj(n) which goes to infinity sufficiently slowly 5 as n — > oo. 
We shall assume throughout the paper that n ^ n (e,C,uj) is sufficiently large. 

For each m G N, let the time t after m steps of the triangle-free process be defined by 
m — t- n 3 / 2 . (We shall use this convention throughout the paper without further comment.) 
Thus, setting 

t* := (^j= - £ ) V^gn and m * : = t* ■ ™ 3/2 , ( 5 ) 

our aim is to follow the triangle-free process up to time t*. 

Recall that Q(m) = \0(G m )\ is the random variable which denotes the number of open 
edges of G m , see (4). The triangle-free process ends when Q(m) = 0, since only open edges of 
G m may be added in step m + 1. The lower bound in Theorem 1.1 is therefore an immediate 
corollary 6 of the following theorem. 

Theorem 2.1. With high probability, 

-At>(A _ 2t 2 7/4 



Q(m) G e~ 4t (JJ ±e- a n"\\ognY (6) 

for every m ^ — e)n?l 2 y/\ogn. 

To see why Q(m) should decay at rate e -4 ' 2 , simply recall that G m resembles a random 
graph of density p ~ 2m/n 2 ; it follows that the proportion of open edges in G m should be 
roughly (1 — p 2 ) n ps e~ p n e~ 4t . Note that the absolute error in our bound (6) actually 
decreases over time; this is possible because we exploit the self-correcting nature of the 
process. We shall denote the (maximum allowed) relative error by 

9q (t) = e^V^Qogn) 3 , 

and say that Q is tracking up to step w! if Q(m) e (l ± g q (t))e~ 4t2 (™) holds for all m ^ w! . 
Note that, by (5), we have g g (t) ^ n~ e for every t ^t*. 

For each of our graph parameters, the first step will always be to determine the expected 
rate of change of that parameter, conditioned on the past. Indeed, for any graph parameter 
/ random variable A(m), we shall write AA(m) = A(m + 1) — A(m) and 

E [AA(m)] = E [A{m + 1) - A(m) \ G m ] . 

We emphasize that our process is Markovian, and thus all of the relevant information about 
the past is encoded in the graph G m . 

The value of E[AQ(m)] is controlled by the following family of parameters, which were 
already introduced informally above. The variables \Y e {m) : e G 0(G m )} determine (the 
distribution of) the number of edges which are closed at each step. 



5 In particular, we could set u>(n) — log log log n. 

6 We write a G b ± c to mean b~c^a^b + c. Note that if t < t* then e 2 * < n 1 / 4 ^ 6 , and hence it 
follows from (6) that Q(m) > for every m < m*. The lower bound in Theorem 1.1 follows since e > was 
arbitrary. 
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Definition 2.2. We say that two open edges e, / G 0(G m ) are Y -neighbours in G m if e and 
/ form two sides of a triangle, the third of which is in E(G m ). For each edge e G E(K n ) and 
each m G N, define 

F e (m) := j/ G 0(G m ) : / is a F-neighbour of e in G m || 

if e G 0(G m )i and set F e (m) = "K e (m — 1) otherwise. 

We remark that, in order to ease our notation, we shall sometimes write Q(m) and Y e (m) 
for (respectively) the collection of open edges and the collection of F-neighbours of e in G rn . 
(In fact, we shall do the same for all of the various variables we define below.) We trust that 
this slight abuse of notation will not cause any confusion. 




Figure 2.1. The sets X e and Y e . 

Once again recalling that G m is well- approximated by G(n,m), it follows that we would 
expect Y e (m) to be asymptotically equal to 2pn ■ e~ 4 ' ~ 4te~ 4 * \fn for every open edge 
e G 0(G m ), and every m ^ m*. Showing that this is true turns out to be the most difficult 
part of our proof, and will be the key intermediate result in the proof of Theorem 1.1. To 
begin, note that / G Y e {m) if and only if e G Yf(m), and define 

^ y ' eeQ(m) 

the average number of Y- neighbours of an open edge in G m . The following simple equation 
governs the expected change in Q(m) at each step: 

E[AQ(m)] = -F(m) - 1. 

Thus, in order to control Q(m), we shall need to track Y(m), which is controlled (see 
Section 6) by the following equation 7 : 

-F(m) 2 + X(m) - 2 ■ Var(y e (m)) ± Q(F(m)) 
E[A7(m)] G ^ . (7) 



7 More precisely, we shall show that these bounds hold for every uj ■ n 3 / 2 < m < m* with high probability, 
as long as the other variables are still tracking. 
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Here the variance is over the choice of a (uniformly) random open edge e G 0(G m ). The 
term X(m) which appears in (7) is defined, similarly as for the Y e , to be the average of the 
following collection of random variables: 

Definition 2.3. We say that two open edges e, / G 0(G m ) are X -neighbours in G m if they 
form two sides of a triangle, the third of which is also in 0(G m ). For each edge e G E(K n ) 
and each m G N, define 

X e (m) := || / G E(K n ) : f is an X-neighbour of e in G m j | 

if e G 0(G m ), and set X e (m) = X e (m — 1) otherwise. 

Again appealing to our random graph intuition, we would expect (and shall prove) that 
X e (m) ps 2e~ 8 * 2 n for every open edge e G 0(G m ). We define 

^ V 7 eeQ(m) 

the average number of X- neighbours of an open edge in G rn . The random variable X{m) is, 
with high probability, controlled by the following equation (see Section 6): 

r _ . - 2 ■ X(m)Y(m) - 3 ■ Cov(A' e (m), Y t (m)) f X(m) + F(m) 2 ' 

E I AX(m) ] e ew * { — 

Here, as in (7), the covariance is over the choice of a (uniformly) random open edge e G 
0(Gr m ). Combining these equations, we shall be able to show that the normalized errors of 
Y and Q form a stable two-dimensional system, and that X is self-correcting (see Section 6.1). 
Hence, by applying our martingale method (see Section 3) to a suitably-chosen Lyapunov 
function, we shall be able to prove both Theorem 2.1 and the following bounds. 

Theorem 2.4. With high probability, 

IN G (l±^fl^).2e-- 2 n and Y(m) G (l ± • Ate^ 

for every to ■ n 3 / 2 < m ^ — £)n 3//2 \/logn. 

Note also that once again, although the relative error is increasing with t, the absolute 
error is decreasing super-exponentially quickly. We shall write 

Q{m) = e ~ At2 Q , X{m) = 2e" 8 * 2 n and Y(m) = 4te" 4 * 2 ^ 

to denote the paths that we expect Q(m), X(m) and Y(m) to follow. 

The alert reader will have noticed that in order to prove Theorems 2.1 and 2.4, we are 
going to need some bounds on Var(F) and Cov(X,Y). In fact, proving such bounds turns 
out to be the main obstacle in the proofs of Theorems 1.1 and 1.2; more precisely, our main 
problem will be controlling the variables Y e (m), which are the key to the process. Our key 
intermediate result with therefore be the following bounds on Y e {m). 
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Theorem 2.5. With high probability, 



Y e (m) G Ate'^y/n ± n 1/4 (logn) 3 



for every open edge e G 0(G m ) and every m ^ to ■ n 3//2 ; and 




(8) 



for every open edge e G 0(G m ) and every to ■ n 3 / 2 < m ^ [w% ~~ £ ) 



) n 3 / 2 A/log n. 



Let us denote the relative error in (8) by 



g y (t) = e 2t VV>gn)4, 



and observe that g y (t) ^> ^(t) and that g y (t) 2 Y (m) 1 (up to a polylog factor). It seems 
plausible that the distribution of the Y e (m) might be 'well-approximated' (in some sense) by 
a collection of independent Gaussians, each centred at Y(m); however, our current techniques 
seem quite far from being able to prove such a strong statement. 

The proof of Theorem 2.5 is somewhat intricate; we shall sketch here just the basic ideas. 
Observe first that the variable Y e (m) is governed by the equation 8 



We remark that the main term (when t is large) is the first one, since Y(m) 2 ^> X(m). 
Note that the rate of change of Y e (m) depends on itself, but also on the Y- values of its 
Y~-neighbours. Thus, although it appears that Y e (m) should be self-correcting (since larger 
values of Y e (m) produce a more negative first derivative), this effect can be outweighed for 
a specific edge e if the Y- values of the edges in Y e (m) are unusually large or small. 

For this reason, it is necessary to introduce a variable (for each open edge e G 0(G m )) 
which counts the number of walks of length two in the F-graph 9 . However, in order to 
control this variable we shall need to track the number of walks of length three, and so on. 
This leads naturally to the following definition. 

Definition 2.6. For each k G N , and each open edge e G 0(G m ), let Ue k \m) denote the 
number of walks in the F-graph of length k, starting from e, and set 



the average of the Y- values reached via such walks. 



8 In fact this is not quite correct: each variable Yf(m) should be replaced by Yf(m) — 1, since if the edge e 
is chosen in step m + 1 of the triangle-free process then AY e (m) = 0, see Definition 2.2. This tiny difference 
will be irrelevant in the calculations below, and so, in the interest of simplicity, we shall ignore such rounding 
errors, as is often done in the case of ceiling and floor symbols. 

9 Recall that this is the graph whose vertices are the open edges in G m , and whose edges are pairs of 
y-neighbours. 
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Note that Y e (m) = V e (m), and that therefore the second statement in Theorem 2.5 is 
the case k = of the following theorem. Fix k (e) = [3/e]. 

Theorem 2.7. For each ^ k ^ fco(e), wii/i probability we have 

/or every open edge e G 0(G m ) and every cu ■ n 3 / 2 < m ^ — e)n 3 / 2 \/logn. 

In fact, in order to prove Theorem 2.7, we shall need to define and control (for each 
k) a more complicated family of variables, \V° : a G {L,R} k }, which takes into account 
whether each step of a walk in the V-graph was taken with the 'left' or 'right' foot; that is, 
which endpoint changed in the step from e G 0(G m ) to / G Y e (m), and which endpoint was 
common to e and /. This is important because a key step in our proof will be to show that, 
for k sufficiently large, after k 2 (random) steps in the F-graph we are (in a certain sense) 
'well-mixed'. More precisely, we shall show that if we have 'changed feet' at least k times 
on this walk then the value of Yf(m) we reach may be well-approximated by Y(m). On 
the other hand, if at some point we took k consecutive steps with the same foot, with the 
other foot fixed at vertex u, then after these k steps we will have (approximately) reached a 
uniformly-random open neighbour of u. We refer the reader to Section 5 for the details. 

In order to prove our mixing results on the F-graph, we shall need good bounds on the 
number of copies of certain structures in G m , which correspond to paths in the F-graph with 
either many or no changes of foot. In order to obtain such bounds, we shall in fact need 
to bound the number of copies of every structure which can occur in G m . We make the 
following definition. 

Definition 2.8. A graph structure F consists of a set of (labelled) vertices V(F), edges 
E(F) C ( n 2 F) ) and open edges 0(F) C ( V ^), where E(F) and 0(F) are disjoint. 

Such a structure is said to be permissible if every triangle in E(F) U 0(F) contains at 
least two edges of 0(F). 

We shall be interested in the number of copies of a graph structure F 'rooted' at a certain 
set of vertices. The following definition allows us to restrict our attention to those embeddings 
for which there (potentially) exists at least one copy of F. 

Definition 2.9. Given a graph structure F and an independent 10 set A C V(F), we say 
that an injective map 0: A — > V(G m ) is faithful at time t if the graph structure on V(F) 
with open edge set 0(F) and edge set 

E(F)UE(G m [ ( j ) (A)]) 

is permissible. 11 

10 In fact, we extend all of the next few definitions to the case where A is not an independent set in F by 
removing the edges and open edges from F[A]. Hence, writing F A for the structure thus obtained, we have 
N A (F) := N A (F A ), t A (F) := t A (F A ), and so on. 

n Note that triangles with at most one open edge do not occur in E(G m ) U 0(G m ). Hence, if is not 
faithful at time t, then N^(F)(m') = for every m! ^ m. 
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Let us refer to a triple (F,A,<f>) such that F is a permissible graph structure, A C V(F) 
is an independent set, and 0: A — > V{G m ) is an injective map, as a graph structure triple. 12 
Given such a triple, we define 

A^(F)(m) := | j^: — >■ V(G m ) : -0 is an injective homomorphism and iJj\a = </>| , 

the number of (labelled) copies 13 of F in G rn which agree with on A, if is faithful, and set 
N ( / ) (F)(m) = N ( f ) (F)(m — 1) otherwise. We remark that, for the sake of brevity of notation, 
we shall often suppress the dependence of N^F) on m. 




Figure 2.2. Two graph structure pairs. 



Let us first estimate the expected size of N^F), i.e., the expected number of copies of F 
rooted at 4>(A), if is faithful. Recalling our guiding principle that G m resembles the Erdos- 
Renyi random graph G(n,m) (except in containment of triangles), and that the (expected) 
densities of edges and open edges in G m are 2t/^/n and e -4 * 2 respectively, it follows that we 
might expect G m to contain roughly 




copies of F rooted at 0(A), where v A (F) = \V(F)\ - \A\, e(F) = \E(F)\ and o(F) = \0(F)\. 

Our next theorem shows that N^F) closely tracks the function Na(F) for every graph 
structure triple (F,A,(f>), as long as is faithful, up to a certain time tA{F) G [0,f]. 
Before we define tA(F), observe that if F has an induced 1 sub-structure A C H C F with 
Na(H) < 1, then we are unlikely to be able to track the number of copies of F, since it 
might be zero, but if N^H) happens to be non-zero then N^F) could be very large. This 
simple observation motivates the following definition of the tracking time of the pair (F, A). 



If only F and A are given (with these properties) then we will refer to (F, A) as a graph structure pair. 
13 ln other words, il)(u)%l>{v) e E(G m ) whenever uv € E(F), and ip{u)%l>(v) e 0(G m ) whenever uv e O(F). 
14 Since we will only be interested in induced sub-structures, we shall identify each sub-structure of F with 
its vertex set. Thus we write A C H C F to indicate that H is a sub-structure of F, and A C V{H). 
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Definition 2.10. Define 15 

t* A (F) = inf jt > : N A (F)(m) < (2t) e(F) } e [0, oo] (10) 

and 

^(F) = min | min {^(#) : A C if C F}, t* }. 
We call t A (F) the tracking time of the pair (F, A). 

Finally, for each graph structure pair (F, A) with tA(F) > 0, define 

c = c(F,A) := maxi max j- — ?°[ H > /rr . \, 2 I, (11) 
V ; \ achcf \ 2v A (H) — e(if) J J ' v ; 

so in particular e c * 2 = n 1 / 4 when t = if t A (F) < t*. 16 

We can now state the 'permissible graph structure theorem'. 

Theorem 2.11. For every permissible graph structure F, and every independent set A C 
V(F), there exists a constant j(F,A) > such that, with high probability, 

/ n , \ e(F) 

N<p (F)(m) e (l ± eV/*(k)g n p)) (e- 4t2 )° (F) f ^=J 

for every u> < t ^ ^(-P 1 ), every faithful <fi: A — > V(G m ). 

We emphasize that when t > t A {F) then N^F) is no longer likely to be tracking (in fact, 
it is quite likely to be zero). Nevertheless, in this case we shall still give an upper bound on 
N^{F) which is (probably) best possible up to a polylog factor, see Theorem 4.1. As noted 
earlier, this latter bound will be a crucial tool in the proof of Theorem 2.11; more precisely, 
we shall use it to bound the maximum step-size in our super- and sub-martingales. We note 
also that we shall need a separate argument, which is based on the proof in [10], to control 
the variables N^(F) in the range < t ^ uj. 

Finally, in order to give an upper bound on the number of edges in G n> A, and to show 
that it is a good Ramsey graph, we shall need to prove the following theorem. 

Theorem 2.12. 

A(G nA ) = (^= + o(l)^jy/n\ogn and a(G nA ) ^ (y/2 + o(l)) v^logn 

with high probability as n — >■ oo. 

The basic idea of the proof of Theorem 2.12 is to sum over sets S (of the appropriate size) 
the probability that S is an independent set in G m * / the neighbourhood of a vertex in G nA . 
In order to bound this probability we shall need to track the number of open edges inside S, 
which will be possible (via our usual martingale technique, see Section 3) only if there are 
not too many vertices which send more than n 5 (for some < 5 e) edges of G m * into S. 

15 Notc that t* A (F) = iff e(F) ^ 2v A (F), and that t A (F) = t* if A = F. 

16 Thus the first graph structure pair in Figure 2.2 has tracking time tA(F) — j^V^°S n an d c(F, A) = 3, 
and the second has tracking time tA(F) = 0. 
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The main challenge of the proof turns out to be dealing with the other case (S has a large 
intersection with many neighbourhoods), see Section 7. We remark that, by comparison 
with G(n, m), it seems very likely that our upper bound on o^G^a) is asymptotically tight; 
however, we leave a proof of this statement as an open problem. 

3. Martingale bounds: the Line of Peril and the Line of Death 

In this section we introduce the method we shall use to bound the probability that we lose 
control of a 'self-correcting' random variable. As in [10, 13], we shall use martingales, but 
the technique here differs in some crucial respects. Our main tool is the following martingale 
bound (see [41, Theorem 3.15], or the Appendix). We remark that, for all of the martingales 
we shall use below, M(m) will depend only on the graph G m+r for some r G N. 

Lemma 3.1. Let M be a super-martingale, defined on [0,s], such that 

\AM(m)\^a and E[\AM(m)\] ^ p (12) 

for every m G [0, s — 1]. Then, for every ^ x ^ (3s, 



P(M(s) > M(0) +x) ^ exp 



x 2 



4a[3s 



For some of the more straightforward martingale calculations (for example, when t ^ u), 
it will be more convenient to use instead Bohman's method. We therefore state here, for 
convenience and comparison, the martingale lemma used in [10]. 

Lemma 3.2. Let M be a super-martingale, defined on [0,s], such that 

-(5 ^ AM(ra) ^ a 

for every m G [0, s — 1]. Then, for every ^ x ^ min{o;, (3} ■ s, 



P(M(s) > M(0) +x) ^ exp 



x 2 



Aaf3s / 

We remark that in both cases, a corresponding bound also holds for sub-martingales. 

3.1. The Line of Peril and the Line of Death. Let A[m) be a 'self-correcting' (see 
Definition 3.3, below) random variable which we wish to track, i.e., show that 

A(m) G (l± g(t))A(m) 

for some functions A[m) and g(t), and for all m in a given 17 interval [a, b). Let us define the 
normalized error to be 

A*(m) = A(m) T i(m) . 

g(t)A(m) 

In order to motivate the rather technical statement below (see Lemma 3.5), we shall begin 
by outlining the basic idea behind the method, which is in fact rather simple. 



17 A typical case will be a — lo ■ n 3 / 2 and b = t^F) ■ n 3 / 2 . We will need to assume that a ^ n 3 / 2 . 
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The basic idea: Suppose that A* is a random walk on R, and we wish to bound the 
probability that the event 

(\A*(m)\ > 1) H/C(m) 

holds for some m e [a, b], where /C(l) D ... D fC(m*) is some sequence of 'good' events 18 . 
When |A*(m)| < 1/2 we need only that the maximum step-size is o(l). When |A*(m)| ^ 1/2, 
on the other hand, suppose that A* has a 'self-correcting drift' (in expectation) back towards 
the line A*(m) = 0. The idea is as follows: do nothing as long as |A*(m)| < 1/2, and step 
in only when we reach the danger zone. More precisely, we control the probability that A* 
'crosses' from |A*(m)| = 1/2 (the 'Line of Peril') to |A*(m)| = 1 (the 'Line of Death') in 
a given interval, [r, r + s]. There are two cases to consider: either A* crosses this interval 
quickly, which is unlikely even without the drift, or A* crosses slowly, which is unlikely since 
it has to constantly swim against the current. In both cases we apply Lemma 3.1 to a 
suitable super- martingale, which give bounds of the form n~ logn for well-chosen g(t). 

Before giving a more detailed description of our martingale method, we remark that, in 
order to prove the various theorems stated in Section 2, we shall consider a polynomial 
number of 'bad' events of the following form: 

"Variable A* is the first of all the variables that we are tracking to cross its Line of 
Death, it does so after r + s steps, and it last crossed its Line of Peril after r steps." 

Clearly, if some variable crosses its Line of Death, then an event of this type must occur. 
Since we prove bounds of the form 

n -logn Qn 

the probability of each event, we will be able 
to deduce, by the union bound, that with high probability none of the bad events occurs. 




LoD 

Figure 3.1. The Lines of Peril and Death. 



'For us, the event K{m) will essentially say that all of our variables are still tracking at step m. 
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In order to make precise the general strategy outlined above for a variable A*, we shall 
(typically) do the following: 

1. Choose a sequence of 'good' events /C(l) D . . . D /C(m*). 

2. Prove that if K,(m) holds then 19 

E[AA*(mj\ e (c±d)--^(-A*(m)±e > ) (13) 

for some constants c > d ^ 0, and moreover 

\AA*(m)\ < a(*) and E[|AA*(m)|] < (3(t). 

3. Call the line |A*(m)| = \ the Lme of Peril and |A*(m)| = 1 the Line of Death. 

4. For each pair (r, s) G N 2 , consider the event £ A (r, s) that |A*(m)| crosses the Line of 
Death (for the first time during the interval [a,b]) in step r + s, and that it crossed 
the Line of Peril for the last time (before step r + s) in step r + 1. 

5. Set s = min{s,n 3 /r}, let a and j3 denote the suprema of a(t) and (3(t), respectively, 
over the interval [r, r + s ] , and check that the inequality 

(logn)" 3 

holds. 20 

6. Define a function M(m) on [r, r + s ] by setting 

cr 

M(m) = |A*(m)| + (m-r)-— , 

and observe that, while K,(m) holds, M is a super-martingale, by (13). We consider 
separately the cases s = s and s — ^ 3 / r ) noting that in either case s ^ n 3 / 2 . 

7. In the former case, we ignore the self-correction and, using Lemma 3.1, bound the 
probability of C A (r, s) D /C(r + s) n {s = s} by 

P (V( S ) > M(0) + < exp (-^) < n-^, 

for some 5 = 5(c) > 0. The final inequality holds by our bound on a(3n 3 ^ 2 . 

8. In the latter case, we use the self-correction of A*{m) to bound the probability that 
the martingale hasn't re-crossed the Line of Peril after sq steps. Indeed, it follows 
from \A*(r + s )\ > 1/2 > \A*(r)\ that M(s ) ^ M(0) + c/4, due to the 'drift' over the 
interval [r, r + s ]. Thus, by Lemma 3.1 and as above, we can bound the probability 
of the event C A (r, s) D /C(r + s ) n {s 7^ s} by 

P (M( So ) > M(0) + |) < exp ("^) < n""*", 

for some 5 = 5(c) > 0, as required. 



19 We write x G (a±b)(c±d) to denote the fact that x = yz for some y £ a±b and z e c±d. 
20 In fact, in order to apply Lemma 3.1 we also need to check that /3s ^ 1, see below. 
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We remark that, crucially, our functions a(t) and (3{t) in each application will vary by only 
a relatively small factor over the range to G [r, r + sq] . Hence we shall not lose much in 
applying Lemma 3.1 with a and (3 fixed, and equal to the maximum of a(t) and j3(t) over 
this interval. 

3.2. A general lemma. In this subsection we shall prove the main lemma of this section, 
which we shall use several times in the proofs that follow. In order to state it, let us first 
make precise what it means for a random variable to be self-correcting. 

Definition 3.3 (Self-correcting). Let A(m) be a random variable, let g(t) and h(t) be 
functions defined on (0, f ), let /C(l) D . . . D /C(to*) be a nested sequence of events, and let 
n 3//2 < a < b < to*. We say that the variable A is (g, h; IC)- self -correcting on the interval 
[a, b] relative to the function A(m) if the following holds for every a < to ^ b: 
If /C(m) holds, then the normalized error 

A*(m) = A{m) Z A{m) (14) 
g(t)A(m) 

satisfies the following inequalities: 

A*(m) > 1/2 E[AA*(to)] < -/i(f) and A*(m) < -1/2 =4> E[AA*(m)] > 

We remark that, in all of the cases which we shall need to consider, hit) = 0(t • n -3 / 2 ) 
and moreover A*(m) satisfies an inequality of the form 21 

E[AA*(m)] e (c±d) ■ -i^-A*(m)±e) (15) 

when K,(m) holds, for some constants c > d ^ 0. It follows easily from the bounds in (15) 
that A is (g, h; /C)-self-correcting. Let us say that a function a(t) is X-slow on [a, b] if it 
varies by at most a factor of A across any interval of the form x ^ t ^ x + | with a ^ x ^ b. 
Note in particular that any function of the form t k e a2 is A-slow for some A = X(k,£) > 0. 

We are almost ready to state our main martingale lemma; the following definition will 
simplify the statement considerably. 

Definition 3.4. We say that a collection (A, 5; g, h; a, f3; IC, I) is reasonable if 

• A > 1 and 5 G (0, 1/4) are constants, • g, h, a and (5 are functions defined on (0, £*), 

• I = [a,b] C [n 3 / 2 , to*] is an interval, • /C(l) D . . . D /C(to*) is a sequence of events, 

and moreover they satisfy the following conditions: 
(a) a and (3 are A-slow on /. 



(b) min {«(£), /3(t), M^)} ^ "§7^ an( i < ^ 2 f° r every to G [a, 6]. 



21 



We write x e (a ± 6)(c ± rf) to denote the fact that x — yz for some y £ a±b and z€c±i 
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We can now state the main result of this section. 

Lemma 3.5. Let (\,5; g, h; a, (3; JC, I) be a reasonable collection, and let A be a random 
variable which is (g, h; JC)- self -correcting on I = [a,b] relative to the function A. Suppose 
that 

\AA*(m)\ < a(t) and E[|AA*(m)|] < p(t) 
for every m G [a, b] for which K{m) holds, and that \A*{a)\ < 1/2. Then 

P^(|A*(m)| > l) fl /C(m — 1) for some m G [a, b] j ^ n 4 exp ( ^ min 



n 



3 /2 t a(t)p(t) J ' 



where 5' = 5 3 /32\ 2 > 0. 

Proof. For each (r, s) G N 2 , we define an event 

r+s-l 

£i(r,s) = {A*(r) < 1/2} n f| {1/2 < A*(m) < 1} n {A*{r + s) > l}, 

m=r+l 

and a corresponding event Cj_(r,s), by replacing the interval [1/2,1] with [—1/2,-1]. If 
| -A* (a) | < 1/2 and |A*(m)| > 1 for some m G [a, b], then the event C+(r,s) U C A (r,s) must 
hold 22 for some a ^ r < m and 1 ^ s ^ m — r. By symmetry, it will therefore suffice to 
prove the following claim. 

Claim: P(c$(r, s) n /C(r + s - 1)) < exp I — 2 mm — J for every r, s 6 N. 

Proof of claim. Set s — min{s,n 3 /r}, and define a function M on the interval [r, r + s ], 
such that M(m) depends only on G m , as follows: 

5 ■ r(m — r) 



M{m) = A*(m) + 



n 3 



We claim that M{m) is a super-martingale on [r + 1, r + s ] as long as A*(m) > 1/2. Indeed, 
since the function A is (g, h; /C)-self-correcting on [a, b] with respect to A, we have 

E[AM(m)] = E[AA*(m)] + — < + < 0, 

since A*(m) > 1/2 and /C(m) together imply that E[AA*(m)] < —h(t). 
In order to apply Lemma 3.1, set 

a — 2- max and /3 = — + 2 • max 

r<m^r+so So r<m^r+so 

and observe that (12) holds, since |AM(m)| ^ |AA*(m)| + 5r/n 3 ^ 2 • a(t), and similarly 
E[|AM(ra)|] < E[|AA*(m)|] + 5r/n 3 < 2 • f3(t), and moreover /3 • s > 1. 
We claim that M(r + s ) > M(r + 1) + 5/2. Indeed, if s = s < n 3 /r then 

(5 • TS 

M(r + s ) = A*(r + s) + — g- > 1 > M(r) + 1/2 > M(r + 1) + 1/4, 



22 



Here we used our assumption that a(t) < 5 2 for every m e [a, 6]. 
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since A*(r + s) > 1, M(r) < 1/2, \AM(r)\ ^ 25 2 and 5 < 1/4. On the other hand, if 
So = n 3 /r, then 

M(r + s ) = + s ) + > - + 5 > M(r) + 5 > M(r + 1) + -, 

2 2 

since A*(r + s ) > 1/2, M(r) < 1/2, |AM(r)| < 25 2 and 5 < 1/4. Hence, by Lemma 3.1, in 
either case we have 

p(/#r,.)nK<r + .-!)) < «p (-jjgj;.) . (16) 

Finally, observe that if /3s ^ 2 then /3 < 4A/9(f), and otherwise 5t • /3s < 25t < 2/3(t)n 3/2 . 
Since s < ^ 3 /r < n 3 /a < n 3//2 , it follows that 

a/3s ^ a • max |4A/3(t)n 3 / 2 , 2/3 j < ^ • a(t)/3(t)n 3 / 2 , (17) 

for every r < m ^ r + sq, since a(t) and are A-slow on [a,b] and s ^ n 3 /r, so each 
varies by at most a factor of A. Combining (16) and (17), the claim follows. □ 

It follows immediately from the claim that 

b x 

U (\A*(m)\ > l)n/C(m-l) < ^p(/>V,s)n/C(r + s-l)) 



r,5 

< n exp — mm 



n 3 / 2 t a(t)fj(t) J ' 

where 5' = <5 3 /32A 2 , as required. □ 
We note that the minimum in the lemma above is, of course, over the interval a ^ m ^ b. 



3.3. The events X(m), y(m), Z(m) and Q(m). To finish this section, we shall motivate 
the martingale technique introduced above by using it to track the variables X e . This is a 
particularly simple special case of the argument used in Section 4, and should help prepare the 
reader for the more involved application performed there, where we use it to control N^(F). 

In order to track the variables X e , we shall need to introduce some more notation, which 
will be used in a similar way throughout the paper. Set 

f y (t) = e^V^Qogn) 5 / 2 (18) 

and f x (t) = e~ 4 ' 2 f y (t), where C = C(e) > is the large constant chosen above. Recall that 
g y (t) = e 2t n _1//4 (logn) 4 , and set g x (t) = Cg y (t). The following simple definitions will play 
a crucial role in the proof of Theorems 1.1 and 1.2. 
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Definition 3.6. For each < to' ^ to*, we define events X(m') : y(m!) and Q(to') as follows: 
(a) X(m') denotes the event that 

X e {m) G X(m) ± f x {t)X{n z ' 2 ) 

for every open edge e G 0(G m ), and every to ^ min • n 3 / 2 , to'}, and 

X e (m) G (l±^(t))X(TO) 

for every open edge e G 0(G m ), and every to ■ n 3 / 2 < to ^ to'. 
(6) 3^(^') denotes the event that 

Y e (m) G Y(m) ± /,(t)F(n 3 / 2 ) 

for every open edge e G 0(G m ), and every to ^ min • n 3//2 , to'}, and 

Y e (m) G (l±^(t))f(TO) 

for every open edge e G 0(G m ), and every to ■ n 3 / 2 < to ^ to'. 
(c) Q(to') denotes the event that 

Q(to) g Q(m)±e- f y (t)Q(n 3/2 ) 

for every to ^ min ■ n 3 / 2 , to'}, and 

X(to) - / v Y(m) „ , s , Q(m) „ . . 
^Hel±ff,(0. ^Hel±# and ^Gl±y,(t) 
A(m) y (to) Q(to) 

for every w • n 3//2 < to ^ to'. 

Similarly, we shall write £{m') to denote the event that the conclusion of Theorem 4.1, 
below, holds for all to ^ to' (see Section 4). In this section, we shall only need the following 
two special cases of that event: 

1. In a slight abuse of notation, if / = {u, v} G E(K n ), then let us write Yf(m) for the 
number of vertices w G V(G m ) such that {u,w} G 0(G m ) and {v,w} G E(G m ), or 
vice-versa, even if / ^ 0(G m ). The event £{m!) implies that, for every / G E(K n ), 

Y f (m) ^ (l + e)y(m) 

if cu • n 3 / 2 < to ^ to', and moreover that Yf(m) ^ if m ^ min{u; • n 3,/2 ,TO'}. 

2. Let (W, A) denote the graph structure pair with v(W) = 4, ^(W) = 1, e(W) = 1 
and o(W) = 2. The event £{m') implies that 

N ( p{W){m) ^ max {4te~ 8t y/n, {lognf} 

for every u> ■ n 3//2 < to ^ to' and every faithful map 0: A — > V(G m ). 
Finally, given e = {u,v} G E(K n ), set 

W = |^ G J«)n7V G „>)|, 

and let Z(m!) denote the event that Z e (m) ^ (logn) 2 for every e G E(K n ) and every to ^ to'. 
We shall use the following bound frequently throughout the proof of Theorem 1.1. 
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Proposition 3.7. Let m G [to*]. With probability at least 1 — n ~ cl ° sn , either the event 
(£(m - 1) n Q(to - l)) c holds, or 

Z e (m) < (logn) 2 

for every e G E(K n ). In other words, 

P(s(m - 1) n Z(m) c n Q(to - 1)) < n~ clogn . 

We would like to emphasize the appearance of the event £(m) in the statement above, 
and the absence of y(m). This is because the event y(m) only gives us a bound on Y e (m) 
for open edges e G 0(G m ), whereas we shall require a bound on the number of (edge, open 
edge) pairs which form a triangle with e for both open and non-open pairs in (^) \ E(G rn ). 
As noted above, the event £(m) gives us such a bound on Y e (m) of the form (l + e)Y{m) 
(see Section 4 for the details). Although the error term in this bound is larger than that in 
Theorem 2.5, it easily suffices for our current purposes. 

Proof of Proposition 3. 7. We imitate the proof of the corresponding statement from [10]. Let 
to' G [m*], and let e = {u, v} G E(K n ). As noted above, if the event £{m') holds, then 

Y e (m) < (1 + e)4te~ 4t2 Vn (19) 

for every to ■ n 3 / 2 < m ^ to', and Yf(m) ^ for every to ^ min{cu • n 3 / 2 ,m'}. Now, note 
that ^ AZ e (m) ^ 1 (deterministically), and that, if £{m) n Q(to) holds, then 

P(Z. (n , + 1 )>Z.( m )) = gg<* = , 
where the second inequality follows from (19) and the event Q(m). It follows that 

p(e(m' - 1) n Z(m'f n Q(m' - 1)) < Q (^j.) Q?)^ < ^ 
for every to' ^ to*, as required. □ 

3.4. Tracking X e . As a concrete example to aid the reader's understanding, we shall now 
show how to track the variables X e ; in order to avoid unnecessary distractions, we postpone 
a couple of the (straightforward, but somewhat technical) calculations to the Appendix. 
Set a = to ■ n 3 / 2 and define, for each to G [m*], 

K x (m) = £(m) n X(a) n y(m) n Q(m). 

We shall prove the following bounds on X e (m), which are slightly stronger than those given 
by Theorem 2.11. We shall need exactly this strengthening in Section 5. 

Proposition 3.8. Let to ■ n 3 / 2 < to ^ m* . With probability at least 1 — n ~ clogn either 
K x {m — l) c holds, or 

X e (m) G (liCe^V^Qogn) 4 ) • 2e" 8 ' 2 n = (l ± g x (t))X(m) (20) 

for every open edge e G 0{G m ). 
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The first step in tracking a variable will always be as follows: we define the normalized 
error, in this case 

= X e (m)-X(m) 
g x (t)X(m) 

and show that, while everything is still tracking, it is self-correcting. More precisely, we 
prove the following lemma. 

Lemma 3.9. Let u ■ n 3/2 < m < m* . If £{m) n X(m) H y(m) n Q(m) holds, then 

At / \ 

e[ax;h] G -^(-x:(m)± e ) 

/or every e G 0(G m ). 

Observe (or see the Appendix) that the evolution of the variable X e is controlled by the 
following equation: 

E[AX e (m)} = -±- £ (Y f (m) + l). (21) 
Vl ' fex e (m) 

Lemma 3.9 follows from (21) via a straightforward, but somewhat technical calculation, and 
is also a special case 23 of Lemma 4.18, which will be proved in the next section; however, for 
completeness we provide a proof in the Appendix. The key idea is that, since g x (t) = C-g y (t) 
and C — C(e) is large, the error in our knowledge of the variables Yf is small compared with 
the error we allow in X e . 

Our next lemma bounds \AX e (m)\. Although simple, our application of it in the martin- 
gale bound is one of the key new ideas in this paper. 

Lemma 3.10. Let u ■ n 3//2 < m ^ m* . If S(m) holds, then 

\AX e (m)\ < max {4te~ 8t2 y/n, (lognT} 

for every e G 0(G m ). 

Proof. The key observation is that, if edge / is chosen in step m + 1 of the triangle- free 
process, then |AA e (m)| = X e (m) — X e (m + 1) is bounded by one of the variables controlled 
by the event £{m). More precisely, note that if an open triangle T containing e is destroyed 
by /, then / must close one of the open edges of T. If more than one such open triangle is 
destroyed, then we must have |e D /| = 1, in which case the number of open triangles which 
are destroyed is exactly N^iW)^), where lm(0) = e U /. As noted above, the event £{m) 
implies that 

\N t f ) (W)(m)\ < max {4te- 8 * 2 v^, (logn)"} 
for every faithful 0, as required. □ 



23 Actually, this lemma is a slight strengthening, since in the case of X e we do not need the event Z(m). 
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Finally, using the two lemmas above, we can easily bound \AX*(m)\ and K[\AX*(m) 
Lemma 3.11. Let u ■ n 3 / 2 < m < m* . If £{m) fl X{m) fl y{m) fl Q(m) holds, then 



\AX:(m)\ < ^- max {for* 2 ^, (log „)«} and E[|AX e *(m)|] < ■ ^ 

/or every e G 0(G m ). 

We are now ready to prove Proposition 3.8. 

Proof of Proposition 3.8. We begin by choosing a family of parameters as in Definition 3.4. 
Set K{m) = K x (m) n X(m) and I = [a, b] = [cu ■ n 3 / 2 , m% and let 

a t = —r max {te- 8t yfii, logn "} and 0(t) = — - • 

^(t) n g x (t) n^l 1 

Moreover, set A = C, 5 = e and h(t) = t ■ n _3//2 . We claim that (A, 5; g x , h; a, f3; K) is a 
reasonable collection, and that X e satisfies the conditions of Lemma 3.5 if e G 0(G m ). 
To prove the first statement, we need to show that a and j3 are A-slow, and that 

min {«(*), 0(f), fc(f)} > ^ 

and a(t) ^ e 2 for every uj < t ^ t*, each of which is obvious, since g x (t) ^ 1 for all t ^ t*. To 
prove the second, we need to show that X e is (g x , h; /C)-self-correcting, which follows from 
Lemma 3.9, and that, for every to ■ n 3//2 < m ^ m*, if /C(m) holds then 

|AX e *(m)| < a(t) and E[|AX e *(m)|] ^ f3(t), 

which follows from Lemma 3.11. Note also that the bound |X*(a)| < 1/2 follows from the 
event X(a), since f x (u)X(n 3 / 2 ) <C g x (u)X(a) if oo(n) — > oo sufficiently slowly. 
Finally, observe that 

e 4 ' 2 - 1 

a(t)f3(t)n 3 ' 2 <: -• max {*e- 8t2 v^, (log n)"} < 

^■(logn) 9 1 ' (log n) 6 

for every u < t ^ £*, since e 4 * 2 ^ n l l 2 ~ £ . By Lemma 3.5, and summing over edges e G E(K n ) 
the probability that e G 0(G m ) and X*(m) > 1, it follows that 

F(x{m) c n /C(m - 1) for some m G [a, 6]) < n 6 exp ( - 5'(logn) 3 ) ^ n- clogri , 

as required. □ 



4. Tracking Everything Else 

In this section we shall generalize the method introduced in Section 3 in order to prove 
Theorem 2.11 under the assumption that the variables Y e , Z e and Q are tracking. In other 
words, as long as the events y(m), Z(m) and Q(m) all hold, we shall give close to best 
possible bounds on the number of copies of an arbitrary graph structure F in G m . 



24 



GONZALO FIZ PONTIVEROS, SIMON GRIFFITHS, AND ROBERT MORRIS 



We begin by recalling that a graph structure triple (F, A, 0) consists of a permissible graph 
structure F, an independent set A C V(F), and an injective map 0: A — > V{G m ). Recall 
that, given such a triple, we set 



/ 9 . \ e(F) 

N A (F)(m) = (e~*y {F) (Aj n^ F \ (22) 

and observe that this is roughly the expected number of (labelled) copies of F rooted at 
(f)(A) in the Erdos-Renyi random graph G(n,m). Recall that t* = — e) ^/logn, and that 
the tracking time of the pair (F, A) is defined to be 

t A (F) = min {min {t* A (H) : A C H C F}, t* } , 

where ^(if) is defined so that N A (H)(m) = (2t)< H ^ at t = ^(if). We shall track N^F) 
up to time t A (F), or until is no longer faithful, for every graph structure triple (F,A,(j>). 
In other words, we shall control the number of (rooted) copies of every permissible graph 
structure, for every root <fi(A), and for (essentially) as long as it is possible to do so. 
Recall that if is faithful, then 

A^(F)(m) = |{V>: V(F) — > V(G m ) : -0 is an injective homomorphism and ip\ A = 0j 

denotes the number of labelled copies of F in G m which agree with on A, and that 
iV^(.F)(m) = N^F^m — 1) otherwise; as noted in Section 2, we shall often suppress the 
dependence of N$(F) on m. We shall also occasionally write N^(F) for the collection of 
copies as above, and trust that this will not cause confusion. Recall that C = C(e) > 0, 
chosen earlier, is a sufficiently large constant. Now define 



A(F, A) = {C 3 v A (F) 2 + 2e(F) + o(F)f (23) 
and, for each A C H C F, set 

A(F, H, A) = A(F, H) + A(H, A) and ^(F, A) = A(F, A) — e(F) — 2. (24) 

Define 

f FA (t) = eW+i)(' a +i) n -i/4(i ogJl )A(^)- v / A(^ i (25) 
and if t^(F) > 0, then set 

^(t) = e rf2 n- 1 /4 (logn) 7(^) ; (26) 

where c = c(F, A) is chosen (as defined in (11)) so that e c * 2 = n 1 / 4 when t = t A (F) if 
t^(F) < £*, and so that c(F,A) ^ 2. We shall bound N^F) up to a multiplicative factor 
of (l ± gF,A(t)) if u < t ^ t A (F), and up to an additive factor of f FtA (t)N A (F)(n 3/2 ) if 
t ^ u < t A (F). Finally, set 

/C £ (m) = y(m) n Z(m) n Q(m) 
for each m ^ to*. As noted earlier, we assume throughout that n G N is sufficiently large. 
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The following theorem is the main result of this section. It says that we can track all 
graph structures up to their tracking time, and that after this time we can bound them up 
to a poly-log factor. 

Theorem 4.1. Let (F,A) be a graph structure pair with no isolated vertices 24 , and let to e 
[m*]. Then, with probability at least 1 — n~ 2logn , either the event K. (to — l) c holds, or the 
following holds for every faithful injection 0: A — > V{G m ): 

(a) IfO<t^uj< t A (F), then 

N^F)(m) e N A {F){m)±f FA {t)N A {F){n^ 2 ). 

(b) J/w < i < t A (F), then 

N^(F)(m) e (l±g F;A (t))N A (F)(m). 

(c) Ift > t A (F), then 

A^(F)(to) ^ (\ogn) A ^ H ' A) N H (F)(m + ), 
where A C H C F is minimal such that t < tu(F), and m + = max{m,n 3 / 2 }. 

We emphasize that the theorem holds for all structures F, not just for those of bounded 
size. However, the statement becomes trivial if F has more than about (logn) 1//c vertices 
(outside A), edges, or open edges, since then both (logn) 7 ^'^ and (\ogn) A ( F ' H ' A ^ become 
larger 25 than the total possible number of labelled copies of F in G m . We shall therefore 
assume throughout this section that v A (F) + e(F) + o(F) ^ (logn) 1 / 5 . In particular, this 
implies that either t A (F) >wor t A (F) = 0, see Observation 4.16 below. 

Using the assumption that F has no isolated vertices, it follows that \A\ ^ (logn) 1 / 5 . 
This assumption will be useful later on, since it will allow us to bound the number of graph 
structure triples (F, A, 0) in our applications of the union bound. However, this is the only 
point at which we will need to assume that F does not have isolated vertices; indeed, it is 
easy to deduce a bound for general F, since the isolated vertices in A impose no restrictions 
(except that their vertices cannot be reused), and because we may embed the remaining 
isolated vertices arbitrarily. If v(F) is small, then we have about n choices for each. 

In what follows, we'll need to assume frequently that the conclusion of the theorem holds 
at all earlier times, so let £{m') denote the event that (a), (6) and (c) all hold for every 
graph structure triple (F, A, 0) with no isolated vertices, and for every m ^ m! such that 
the map is faithful at time t. Thus, we are required to prove that 

P(£(m) c n y(m - 1) n Z(m - 1) n Q(m - 1)) ^ n - 2logn 

for every m ^ to*. 

24 Thc condition that F has no isolated vertices is necessary in order to disallow the possibility that A is 
extremely large, in which case N^F) ^ (n — |^4|) t ' A ( F ), and so the conclusion of the theorem does not hold. 
However, one can easily deduce bounds for an arbitrary triple (F, A, <f)), see the discussion after the theorem. 

25 More precisely if (\ogn)^ F ' A ^ ^ n ^(F)+ e (F)+i thcn f FA ( t )N A (F)(n 3 / 2 ) > n VA ^ for every t ^ u, and 
also g F ,A(t)NA{F)(m) > n VA ^ for every cj < t < t A (F), since N A (F)(m) > 1 for every such t. 
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Remark 4.2. In fact, the event £{m) implies that N ( j ) (F)(m) satisfies the conclusion of 
Theorem 4.1 (i.e., either (a), (b) or (c)) for every graph structure pair with v(F) = n°( l \ 
every m G [m*] and every faithful injection (f>: A — > V(G m ), see the Appendix. 

The proof of Theorem 4.1 is roughly as follows. Proving that part (a) is not the first to 
go wrong is the most straightforward, and follows by Bohman's method from [10], suitably 
generalized (see Section 4.6). For part (6), we shall count the expected number of copies 
of F which are created or destroyed in a single step of the process, and deduce that N^(F) 
exhibits a certain self-correction, of the type considered in Section 3. We also analyse the 
maximum possible number of copies of F rooted at <f)(A) which can be created or destroyed 
in a single step. Together with the method outlined in Section 3, this will be enough to 
deduce that part (b) of the theorem does not go wrong before part (c). 

In order to prove part (c), we break up the structure F into its building sequence 

A C H C • • • C H e = F, 

which has the property that the graph structure pair (H i+ i,Hj) is balanced 26 for every 
< % < I, see Section 4.1. Since A(F,H,A) is super-additive with respect to this sequence, 
this will allow us (see Section 4.7) to restrict our attention to balanced pairs (F,A). The 
bound in (c) at time t = tA{F) follows for such structures by part (b); in order to prove it 
for larger values of t we apply a slight variation of our martingale method. 27 

Finally, we note that the proof of Theorem 4.1 is extremely long and technical. We 
encourage the reader who wishes to avoid getting bogged down in such technical detail 
to skip forward to Section 5, and reassure him that the remaining sections are essentially 
independent of this one. The one crucial thing that he will need in order to follow the proofs 
in Sections 5 and 7 is the definition of the event £(m). 

4.1. Building sequences. We begin the proof by introducing our key tool for controlling 
unbalanced graph structures. The following crucial definition allows us to break the process 
into a sequence of balanced steps, each of which we can control more easily. 

Definition 4.3 (Building sequences). A building sequence of a pair (F,A), where F is a 
graph structure and A C V(F) is an arbitrary subset 28 of the vertices of F, is a collection 

A C H C • • • C H e = F 

of induced graph sub-structures of F such that the following conditions hold: 

• H Q is maximal over structures with t* A (H ) = and Na{H ) at t = 1/2 minimal. 

• Hj + i is maximal over structures HjC-H'ZF with t* H .(Hj+i) minimal. 



For graph structure pairs (F, A) with tA(F) > 0, we shall say that the pair (F, A) is balanced if and only 
if t* A (H) t* A (F) > for every A C H C F. If t^(F) = then the definition is slightly more complicated, 
see Section 4.1. 

27 In fact, in Section 4.4 we shall prove a slight strengthening of this bound for balanced pairs (F, A). We 
shall need this strengthening in order to bound \AN^(F)(m)\ when t ^ tA(F), see Section 4.5. 

28 Note that the edges and open edges of F[A] do not affect the definition, so the reader can think of A as 
being an independent set, as usual. 
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Given such a sequence, we define t = and tj = t* Hj l (Hj) for each j e [■£]. 

We begin by stating the various properties of building sequences which we shall prove in 
this subsection; we assume throughout that v A (F) + e(F) + o(F) < (log n) 1/5 . Our first 
lemma shows that building sequences exist, and are unique. 

Lemma 4.4. Every graph structure pair (F, A) has a unique building sequence. 

The times tj are not only well-defined, they are increasing. This fact will also follow from 
the proof of Lemma 4.4. 

Lemma 4.5. For every graph structure pair (F, A), we have = to < t\ < ■ • • < ti ^ oo. 

Lemmas 4.4 and 4.5 allow us to describe the graph structure in part (c) of Theorem 4.1. 

Lemma 4.6. Let < t < t* , and let ^ j ^ I be such that tj ^ t < tj +1 , where tg+i = oo. 
Then Hj is the minimal A C H C F such that t < tji(F). 

The next lemma gives an alternative, and perhaps more intuitive characterization of the 
time te- We remark that if t < then the conclusion of the lemma does not hold. 29 

Lemma 4.7. Let (F,A) be a graph structure pair with building sequence A C H C • • • C 
H e = F. Ift^ te, then 

N H (F)(m) ^ (20 e(F)_e(if) 

for every A C H C F . 

We now make an important definition, already mentioned above. 

Definition 4.8 (Balanced graph structure pairs). A graph structure pair (F,A) is said to 
be balanced if the building sequence of (F, A) is either 

ACH = F or A = H C H l = F. 

Thus (F, A) is balanced if and only if one of the following holds: 

(a) t* A (H) > t* A (F) for every A C // C F, and > 0. 

0) e(//) - 2w A (iJ) < e(F) - 2v A {F) for every A C // C F, and = 0. 

It follows from Lemma 4.5 that each pair in the building sequence of each 

(F, A) is balanced. Indeed, we have t* Hi (H) > t* H .(H i+ i) > for every Hi C H C F , by 
Definition 4.3 and Lemma 4.5. The pair (Ho, A) is also balanced; this follows directly from 
the definitions. We record this important property in the following lemma. 

Lemma 4.9. Let (F, A) be a graph structure pair with building sequence A C H C • • • C 
Hn = F. Then the pairs (H , A) and (Hi, -f/j-i) for each 1 ^ i ^ £ are all balanced. 

We shall use Lemma 4.9 in conjunction with the following simple observation to deduce 
bounds on N<f,(F) for unbalanced pairs (F, A) from bounds for balanced pairs. 

29 To see this, simply set H = H^_ 1 . Note also that Lemma 4.7 implies the case j = t of Lemma 4.6. 
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Lemma 4.10. Let (F,A,<f>) be a graph structure triple with building sequence A C H C 
• • • C He = F, and suppose that <fi is faithful at time t. Then, for each A C H C F , 

N+(F)(m) < N^H)(m)- max N^(F)(m), 

<f>':H^V{G m ) 

and hence, for each ^ j ^ £, 

N^F)(m) <: N^H )(m) ■ ( T\ max A^(# i+1 )(m) V max 7VV(F)(m), 

where for each ^ i ^ j £/ie maximum is taken over injective maps fa: V{Hj) — >■ V((jr m ) 
which are faithful at time t. 

For example, we can use this idea to improve (under certain circumstances) our upper 
bound on N^F) for unbalanced pairs (F, A). We will use the following lemma in Section 4.5, 
below, to bound the maximum possible size of \AN^(F)(m)\. 

Lemma 4.11. Let (F,A,(j)) be an unbalanced graph structure triple with building sequence 
A C H C • • • C He = F, and suppose that t ^ tg, and that <fi is faithful at time t. If £{m) 
holds, then 

N^F)(m) <: (\ogn)^ F ' H ^> A \ 

Having stated the main results of this subsection, let us now turn to the proofs. We 
begin with a few straightforward observations, which we shall use on numerous occasions 
throughout the section. 

Observation 4.12. For every graph structure pair (F,A) and every A C H C F, 

N A (H)-N H (F) = N A (F). 

Proof. This follows easily from the definition of N A (F). Indeed, simply note that v A (F) = 
v A (H) + v H (F), e(F H ) = e(F) - e(H) and o(F H ) = o(F) - o(H), and use (22). □ 

Given a graph structure pair (F, A), and two substructures A C H C F and A C H' C F, 
we shall write HUH' and HDH' to denote the substructures of F induced by V(H) UV(H') 
and V(H) n V(H'), respectively. 

Observation 4.13. For any A C H C F and AC H' C F, 

N A (H U if') • 7V A (# n if') < iV A (ff ) • N A (H'). 

Proof. This also follows easily from the definition. Indeed, simply note that 

v A (H U H') + v A (H n H') = v A (H) + v A (H'), 

e(H U H') + e(ff n H') ^ e(H) + e(H') and o(H U if') + o(H n if') > o(ff) + o(ff'). □ 

Note that we do not necessarily have equality in the observation above, since there may be 
edges of F between V(H) and V(H') which are not in either H or H'. The next observation 
follows easily from the previous two. 
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Observation 4.14. For any A C H C F and AC H' C F 7 we /iawe 

7V„(# U if') < N HnH ,(H') and t* H (H U H') ^ t* HnH ,(H'). (27) 

Proo/. By Observation 4.12, we have N A {H U H') = N A (H) ■ Nh{H U H') and N A (H') = 
N A {H n if') • N HnW {H'). By Observation 4.13, it follows that TV^ff U H') < N HnH/ (H') 
for every m e [m*], and hence t* H (H U if') ^ t* HnH ,(H') by Definition 2.10. □ 

Our next observation is slightly more technical. It is also the point at which we use our 
assumption that v A (F) + e(F) + o(F) ^ (logn) 1//5 . 

Observation 4.15. Let (F,A) be a graph structure pair, and let A C H\ C ff 2 C F and 
ACH 3 CH 4 <ZF. 7/0 < ^(#2) < ^3(^4) < oo, tfien 

^ 3 ™ 2 -%(^2) 2 » » e(F)loglogn. 

Proof. This follows easily from the definition of t* A (F). Indeed, it follows from (10) that if 
< t* A (F) < 00, then 

and so if < t* Hl (H 2 ) < t* Hz (H 4 ) < 00, then 

t* H3 m 2 ~ t* Hl (H 2 f > 64 lQ ^ )2 » » e(F)loglogn, 

where the final two inequalities follow since e(F) + o(F) < (logn) 1 / 5 . □ 

Finally, let us note an easy consequence of the previous observation, which was already 
made earlier in the text. 

Observation 4.16. Let (F,A) be a graph structure pair. Ift A (F) > 0, then 

i A (F) > (logn) 1 / 4 >U ;. 
Moreover, o(F)/v A (F) ^ c(F, A) < 2o{F) + 2. 

Proof. Simply apply Observation 4.15 with Hi — if 2 — if 3 — A and A C ff 4 C F arbitrary. 
Since t* A (A) = 0, it follows that t* A (H A ) > (logn) 1/4 for every A C ff 4 C F, and hence 
t A (F) ^> (logn) 1 / 4 , as required. The bounds on c(F,A) follow immediately from (11), 
noting that 2v A (H) > e(if) for every AC H C F, since t A (F) > 0. □ 

We are now ready to prove Lemmas 4.4 and 4.5. 

Proof of Lemmas 4-4 an d 4-5- We begin by showing that the graph structure H is well- 
defined, and that t\ > t Ho (F) > 0. Set 

Ho = {A C if C F : 7V A (if) at i = 1/2 is minimal). 
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We claim that the collection Ho is union-closed, 30 and hence that H = \J H€H ^- Indeed, 
let H, H' e Ho, and recall that 

N A (H U H') ■ N A (H n H') < N A {H) ■ N A (H') (29) 

by Observation 4.13. Moreover, by the minimality of N A (H) = N A (H'), we have 

N A (H U H') > N A {H) and 7V" A (# n H') > 7V A (//') 

at time t = 1/2. It follows that the four terms in (29) are all equal at t — 1/2. In particular, 
we have N A (H U H') = N A (H), and hence H U H' G H , as claimed. 

Now, suppose that t* Ho (H) = for some H C H C F. Then e(if) - e(if ) > 2v Ho (H), 
and so N Ho (H) ^ 1 at time t — 1/2. By Observation 4.12 it follows that 

iV^tf) = N A (H ) ■ N Ho (H) < #a(#o) 

at t = 1/2, which contradicts the maximality of H . Thus t Ho (F) > as claimed. 

Now suppose that we have already constructed A C H C • • • C Hi ^ F in a unique way; 
we claim that there exists a unique ifj C ifj+i C F which is maximal over structures with 
t* H .(H i+ i) minimal, and that ij+i ^ tn^F) > U. The argument is almost the same as that 
above. Indeed, setting 

H i+1 = CJJCF : t* H _(H) is minimal |, 

we make the following claim. 

Claim: "Hj+i is union-closed and ti+i > ti. 

Proof of claim. We first show that tu^F) > ti, i.e., that t* H .(H) > ti for each Hi C H C F. 
When i = this was proved above, so let i ^ 1 and suppose that t* H .(H) ^ £j for some 
Hi C. H C. F. Then, at t = £«, we have 

jV^tf) = NhUH) ■ NhAH) < (20 eW - e ^- l) , 
and so t* Hi i (H) ^ tj, which contradicts the maximality of iJj. Hence t Hi (F) > t i: as claimed. 

Next, let H, H' e Hi+i, and note that we have Nh.(H D if') ^ 1 at £ = t i+1 , since either 
HHH' = H, or ^.(if n if') > > w, by Observation 4.16. It follows that 

N Hi (HUH') <: N Hi (H U H') ■ N Hi (H n H') <: N Hi (H) ■ N Hi (H') <: ^t)< H)+e{H '^ 2em 

at time t = t i+ i, by Observation 4.13 and since t* H (H) = t* H .(H') = t i+1 . If e(H U H') = 
e(H) + e(H')-e(Hi), then it follows that t* H .(HUH') ^ t i+l , and hence that HUH' e H i+1 , 
as required. On the other hand, if e(H U H') > e(H) + e(iJ / ) — e(i?j) then we gain a 
factor of \fn/2t in our application of Observation 4.13. Since y/n 3> (2t) 2e ^ F \ it follows that 
t* H .(H U if) < t i+ i in this case also, as claimed. □ 

The claim implies that iij+i = {JueH- i ^> wmcn completes the proof of the lemma. □ 

We next prove Lemmas 4.6 and 4.7. Both are easy consequences of the following lemma. 

30 Note that if t A {F) > then H = {A}, and so the claim holds trivially. 
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Lemma 4.17. Let (F,A) be a graph structure pair, and let A C H C F. If < j < £ is 

minimal such that Hj %. H , then 

t H (F) < t* H (H U Hj) < f,. 

Proof. We shall in fact prove that 

< t* H (HUHj) < < t r (30) 

The first inequality follows by the definition of tn(F), and the second follows by Observa- 
tion 4.14. In order to prove the third, note that Hj-i C H C\ Hj, and that therefore 

N Hi .M) = N Hj .AH n • N H n Hj (Hj) (31) 
by Observation 4.12, where we set = A. Now, if j = then, by the definition of H , we 
have N A (H n if ) ^ N A (# ) at time t = 1/2. By (31), it follows that N H nH (H ) ^ 1, and 
hence t* HnHo (H ) = 0, as claimed. On the other hand, if j ^ 1 then 

Nn^Hj) = (2t) e W)-e(^-i) an d 7VV,_i(# n if,-) > (2t) e ( HnH i)" e ( H i-i) 
at time t = t,-, since minimizes t* H (Hj) = tj. By (31), it follows that 

N H n Hj (Hj) ^ (2tyW-« HnH * 
at time t = tj, which implies t* HnH (Hj) ^ tj, as claimed. Hence (30) holds, as required. □ 

We next prove Lemma 4.7, which gives a natural alternative definition of the time te, and 
deals with the case j = £ of Lemma 4.6. We will also use it later on, see Section 4.4. 

Proof of Lemma 4-7. Recalling that He = F, and this time setting H_\ = 0, we have 

t 

Nh(F) = UNhuh-^HUHj), (32) 

3=0 

by Observation 4.12. In order to bound N HuHj _ 1 (H U Hj), note that if H U ^ H U Hj 
then Hj % H U -f/j-i- Thus, applying Lemma 4.17 to H U -f/j-i, we obtain 

tHUHj-iiH u Hj) ^ 

and hence 

Nhuh^AH U Hj)(m) < (2t) e (M J )-e(^_ 1 ) 

for every £ ^ tj. Since t ^ te ^ tj for each ^ j ^ £, this (together with (32)) implies that 
N H (F)(m) < (2t) e{F) ~ e{H) for every t > ^, as required. □ 

We can now easily deduce Lemma 4.6. 

Proof of Lemma 4-6. The case j — £ follows by Lemma 4.7, since it implies that t* H (F) ^ te 
for every A C H C F, and we have tp(F) = t*. So let ^ j ^ £ — 1 and suppose 
that tj ^ t < tj +1 . Note first that t Hj (F) = min{t, +1 , t*}, by the definition of ifj+i, so 
t < tHj(F), as required. On the other hand, by Lemma 4.17 we have tjj(F) ^ tj ^ t for 
every A C H C F with 2 Hj, since the t, are increasing, by Lemma 4.5. Hence Hj is the 
minimal A C H C F such that £ < t H (F), as claimed. □ 
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Finally, let us prove Lemma 4.11. 

Proof of Lemma J^.ll. By Lemma 4.5 we have tg-i < tg. Thus, by Lemma 4.10 and the event 
£{m), and using Remark 4.2, we have 

N+{F)(m) ^ N^Ht^im) ■ max N^{F){m) ^ (\ogn) A ^ H ^ +A ^' A \ 

4/:H t _ 1 -^V{G m ) 

as claimed. □ 

4.2. Self-correction. In this section we shall prove that, for each graph structure triple 
(F,A,4>), the random variable N^(F) is self-correcting (in the sense of Section 3) on the 
interval ou < t ^ tA(F), as long as the events £(m), y(m), Z{m) and Q(m) all hold, and 
the map <fi is faithful. This will be a crucial tool in our proof that these variables track the 
functions Na{F) on this interval. We begin by defining N£(F) to be the normalized error in 
N<p(F) at time t, i.e., 

9 F At)N A (F)(m) 

Note that N^F) = (l + g F>A (t)N£(F))N A (F), and that therefore, in order to prove Theo- 
rem 4.1(6), we are required to prove that \N£(F)(m)\ < 1 for every oo < t ^ t A (F). Recall 
that e > denotes a sufficiently small constant which is fixed throughout the proof, and 
that c(F, A) ^ 2 for every graph structure pair (F, A). We shall prove the following lemma. 

Lemma 4.18. Let (F, A, <p) be a graph structure triple, let u < t ^ tA(F), and suppose that 
0: A — > V{G m ) is faithful at time t. If £{m) fl y(m) fl Z{m) fl Q(m) holds, then 

E[AN;(F)(m)] e (c(FA) + e ^y^.(-N;(F)(rn)± 

We begin by calculating the expected change in N^F). We will need the following family 
of graph structures. 

Definition 4.19. Given a permissible graph structure F, we define F° F to be the family of 
(labelled) graph structures F° obtained by changing an edge of F into an open edge. We 
call this (changed) edge F -vulnerable. 

We make a quick observation, which follows immediately from the definition, and which 
we shall use frequently in the proofs below. 

Observation 4.20. Let (F,A) be a graph structure pair. Then \F F \ = e(F), and 

2te 4t2 ■ N A (F°)(m) = ^ ■ N A (F)(m) 

for every F° G T F . 

Proof. Since v A {F) = v A (F°), e(F) = e(F°) + 1 and o(F) = o(F°) - 1, the observation 
follows immediately from the definition (22). □ 
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Recall that we use N^(F) to denote both the collection of copies of F rooted at 4>{A) in 
G m , and the size of this collection. 

Lemma 4.21. Let (F,A,(/>) be a graph structure triple, and suppose that <p is faithful at 
time t, where < t ^ t*. If Z{m) holds, then 

R[AN+(F)] e E E Y f^ + E W) ± o(Ff(\ognyN,(F)\ 

^ ' ^ F'eN^F) feO(F*) F°eF° ' 

Proof. Let e be the edge chosen in step m + 1 of the triangle-free process. A copy of F 
rooted at 4>(A) is created when e is the F-vulnerable edge of a copy of some F° e J 7 ^, rooted 
at 4>(A). Moreover, each copy of F is created by at most one such structure F°. Thus the 
expected number of such copies of F created in a single step is exactly 

y ^£!> ,34) 

Similarly, a copy F* of F rooted at 4>{A) is destroyed when an open edge of that copy is 
closed by the addition of e, which occurs with probability 

U//H e g^y E//(-)±g^y EjWnWl. (35) 



Q(m) 



feO(F*) 



feO(F*) ^ v 7 f,f'eO(F*) 



Now, if / and /' are disjoint then \Yf(m) r\Yf/(m)\ ^ 2, so suppose that e = {«,!>} closes 
both / = {v,w} and /' = {v,z}; then {w,u>} and {u,z} must both be edges of G m , and 
so (assuming the event Z(m) holds) there are at most (logn) 2 such edges e. Combined 
with (35), this implies that the expected number of copies of F destroyed by e is 



E ( E Y f (m)±o(Fnio g n)A, 

Vl ' F'eNJF) \feO(F*) J 



(36) 

*eiV (F) x /eO(F*) 

as required. □ 



In the proof of Lemma 4.18, and several times later in the paper, we shall need to use the 
Product Rule, which we state here for later reference. 

Lemma 4.22 (The Product Rule). For any random variables a{m) and b{m), 

E[A (a(m)b(m))] = a(m)E[Ab(m)] + b(m)E[Aa(m)] + E [(Aa(m)) (Ab(m))] . (37) 
In particular, if a{m) is deterministic, then 

E[A(a(m)b(m))] = a(m)E[Ab(m)] + Aa{m) (b(m) + E[A6(m)]) . (38) 

Proof Simply expand the right-hand side, and use linearity of expectation. □ 
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We remark that for all of the random variables A(m) which we shall need to consider, the 
single step change AA(m) will be much smaller than A(m), and hence the final term in (37) 
will be negligible. In order to bound |AA^(F)(m)| and E[|Ai\^(F)(m)|], we shall also need 
the following related inequality. 

Lemma 4.23. Let A{m) be a random variable, let A(m) and g(i) be functions, and set 

A*(m) = A(m) T i(m) . 

g(t)A(m) 

If\A(m)\ ^ (l + g(t))A(m), 

\AA(m)\ « l ^-A{m) and \A(g(t)A(m))\ « • g(t)A(m), (39) 

then 

\AA*(m)\ < 2- ( + 



( \AA(m)\ l+g{t) logn \ 
\g(t)A(m) g(t) " ' J 



We postpone the (straightforward) proof to the Appendix, and remark that the condi- 
tion (39) is satisfied by the functions N A (F) and g F:A (t). We shall also use the following 
easy observation. 

Observation 4.24. // (F, A) is a graph structure pair and F° G T F , then tA(F) < t A (F°). 

Proof. This follows easily from the definitions, using Observations 4.16 and 4.20. See the 
Appendix for the details. □ 

Finally, we need the following relations between different error terms. 

Observation 4.25. Let (F,A) be a graph structure pair, and let F° e F° F . Then 

°( F )g y (t) < and gF-A 1 ) < OfA*) 

as n — > oo. 

Proof These also both follow easily from the definitions, using Observation 4.24. We again 
postpone the details to the Appendix. □ 

We are now ready to prove Lemma 4.18. In the proof below, in order to simplify the 
calculations we shall use the symbol ~ to indicate equality up to a multiplicative factor of 
at most 1 ± 0(1 /n). We emphasize that this error term will never play an important role. 

Proof of Lemma \.18. By Lemma 4.21, and since Z(m) holds, we have 

E[A W ] e - ' £ £ Y l(m) + £ ± 

L VK /J Q(m) ^ ^ 1 ^ Q(m) Q(m) 

Moreover, differentiating (22), we obtain 

n 1 \ 1 J Q(m) pfrto Q(m) 
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since |gy ~ J72, \F°f\ = e(F) and tn 3 / 2 ■ N A (F°) = N A (F) ■ e~ 4 'V/2 w N A (F) ■ Q(m). 
Subtracting, and using the event y(m) and our assumption that o(F) <C n°^\ we obtain 

E[A W ] - AN 4F) e-(l± 9y (t)) ■ 0(F) ■ ^™ + 0(F) ■ ^™ 

Note that we have u < t < i A (F) «S *a(-F°) for each F° e F F , by Observation 4.24. Thus, 
using (33), the event Q(m) and the fact that g q (t) <C g y (t) to bound the first term, and the 
event £(m) D Q(m) to bound the third and fourth terms, it follows that the right-hand side 
of (40) is contained in 



(l - (1 ± 2g v (t)) (1 + 9F Am(F))) ■ o(F) ■ N ^2' YM 

Q(m) 



± 

Q( 



j_ ^ / 1± ( t) _ x ^(1 + ^))^ 



Now, dividing both sides by g FjA (t)N A (F), and using Observation 4.25, we obtain 31 
E[AiV,(F)]-AiV. 4 (F) 8t-o(F) e(FU 

s,..wjv x( n £ -^7^'W ± ^( i+ — J- < 41 > 

The proof is almost complete; all that remains is a little simple analysis. Indeed, since 
N^F) - N A (F) = g FiA (t)N A (F) ■ N*(F), the Product Rule (Lemma 4.22) gives 

g F ,A\ t ) N A(F) g F>A (t)N A (F) V / 

and since g F , A (t)N A (F) is equal to (2t) e ( F M c< - F ' j4 - )_4o ( i? * ) - ) * 2 times some function of n, we have 

A(g F , A (t)N A (F)) » -L ^ + 2t(c(F,A) -4o(F))) • ^(^(F). (43) 

Combining the last three displayed lines, and observing that the terms involving St ■ o(F) 
in (41) and (43) cancel one another 32 , we obtain 



31 Here we again use the fact that ~ -§72' I^fI = e ( F ) and f « 3/2 ' N A (F°) w 7V A (F) • Q(m). To 

bound the final term, recall that c(F, A) ^ 2, and so g F ^A(t)Q(m) ^ n 7 / 4 e~ 2 ' 2 ^ n 3 / 2 + e for every t < i*. 
32 Note also that, by (43), the final term in (42) is swallowed by the error term. 
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Noting again that c(F, A) > 2 for every pair (F, A), it follows that 

as required. □ 

We finish the subsection by deducing the following easy consequence of the observations 
above, which will be necessary in the martingale argument to follow. 

Lemma 4.26. Let (F, A, <fi) be a graph structure triple, let u> < t ^ t A {F), and suppose that 
<j) is faithful at time t. If E(m) fl y{m) fl Z{m) fl Q(m) holds, then 

E[\AN;(F)(m)\] < C - l ° gn f 1 + ^Aty 



™ 3/2 \ 9F , A (t) 

Proof. Observe first that E[|AiV^(F)(m)|] is at most the expected number of copies of F 
rooted at 4>{A) created in step m + 1 of the triangle-free process, plus the expected number 
of copies destroyed. By (34), and since 8(m) D Q(m) holds and cu < t ^ t A (F) ^ t A (F°), 
the expected number of copies created is 

where the second inequality follows using Observations 4.20 and 4.25. 

Similarly, by (36) and the event £{m) fl y(m) fl Z{m) fl Q(m), the expected number of 
copies destroyed in step m + 1 is at most 

oL) E ( E Y f ( m ) + o(Fn\ognA ^2-(l + g F Xt))-o(F).^ 2 .N A (F), 

since g q (t) < < 1 and o(F) 2 (logn) 2 < n o(1) < g y {t)Y(m) for all t ^ t*. Thus 
E[|AJV,(F)(m)|] < 16t ■ (l + g F , A (t)) + o(F)) ^^ 2 M ■ 
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Now, by Lemma 4.23, we have' 

L * J \g F At)N A (F)(m) 9fA*) » 3/2 

and hence it follows that 



n 3/2 V 9f,a(«) 

as required. □ 



33 Thc conditions in (39) follow from (43), the event £{m) and the fact that e(F) + o(F) + c(F,A) < 
(logn) 1 / 4 , which holds by Observation 4.16. 
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4.3. Creating and destroying copies of F. In order to apply our martingale technique 
to the self-correcting variables N^F), we shall also need to bound the maximum possible 
step size of each of these variables, under the assumption that all of the other variables are 
still tracking. We shall do so by showing that each copy of F rooted at 4>(A) which is created 
or destroyed in step m+1 corresponds to another graph structure F' in G m . We shall thus 
be able to bound the number of such copies of F using the event S(m). 

Let us first consider the number of copies of F rooted at <fi{A) which can be created by 
the addition of a single edge e. Note that this is exactly the number of copies (in G m ) of 
graphs in F F whose F- vulnerable edge is e, and which are rooted at 4>{A). This observation 
suggests the following definition. 

Definition 4.27. Given a graph structure F and an independent set A C V(F), define 
the family Fp A to be the collection of (labelled) pairs (F + ,A + ) obtained by absorbing the 
endpoints of an edge of F into A (to form A + ), and removing the edges inside A + . 

Note that there are at most e(F) pairs {F + , A + ) in Tp A . The following lemma motivates 
the definition above. 

Lemma 4.28. Let (F,A,(f>) be a graph structure triple. The number of copies of F rooted 
at 4>(A) created by the addition of a single edge to G m is at most 



where the maximum is over faithful maps + : A + — > V{G m ). 

Proof. Let e G E(K n ) be the edge added in step m + 1, and consider the family of copies of 
F rooted at <p{A) which are created by the addition of e. We partition this family according 
to the endpoints of e in V(F) \ A, i.e., according to which edge of F was added last. 

Now, each part of this partition corresponds 34 to a pair (F + , A + ) G Fp A ; ^ us consider 
one such pair. The number of copies of F in the corresponding part is exactly iV^+(.F + ), 
where <p + : A + — > V(G m ) satisfies <P + \a = 4> and maps the extra vertex (or vertices) of A + to 
the endpoint(s) of e. The lemma follows immediately. □ 

We next turn to destroying copies of F. Given a pair (F,A), we would like to define 
a family Tp A of pairs (F~,A~) in such a way that the appearance of such a pair in G m 
corresponds to the destruction, by a given edge e G E(K n ), of a copy of F rooted at some 
given 4>{A). After some thought, this leads to the following, somewhat convoluted definition. 



Note that if two edges share an endpoint and have the other endpoint in A then we obtain the same 
pair (F + ,A + ). However, in that case one of the two parts corresponding to (F + ,A + ) is empty. 





(F+,A+)eT+ A 
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Definition 4.29. T FA consists of all graph structure pairs (F~, A~) which are obtained as 
follows: 35 

(a) Set A~ = A and F~ = F U {e}, where e ^ E(F) U 0(F) is an edge from a vertex of 
A to a vertex v G V(F) \ A, where v has an open F-neighbour in A. 

(b) Set A~ = A U {v } for some v G V(F) \ A, and let F~ = F A be obtained from F by 
removing the edges inside A~ . 

(c) Set A~ = A U {v} for some v g V(F), and let F~ = (F + v) U {e} be obtained by 
adding to F the vertex v and an edge e from v to some vertex of V(F) \ A. 

(d) Set A" = A U {w, f } for some u,v E V(F) \ A, and let F~ = F A be obtained from 
F by removing the edges inside A~ . 

(e) Set A- = A U {u, for some u G V(F) \ A and -u ^ V(F), and let F" = (F + w) A 
be obtained from F by adding the vertex v and removing the edges inside A~ . 

(/) Set ^ = iU {w, for some u G V(F) \ A and v ^ V(F), and let the structure 
F~ = (F A + v ) U {e} be obtained from F by adding the vertex v, adding an edge 
e from v to some vertex of V(F) \ A" , and removing the edges inside A~ . 

The following table will be useful in the calculations below. 





(a) 


(b) 


(c) 


(d) 


(e) 


(/) 


v A -(F-)-v A (F) 





-1 





-2 


-1 


-1 


e(F-) - e(F) 


1 


^ 


1 


^ 


^ 


s$ 1 


o(F-) - o(F) 





^ 





«S 


^ 


^ o 



Table 4.1 



Note in particular that 

va'(F') ^ va(F), 



o(F') ^ o(F) and e(F') ^ e(F) + 1 (44) 
for every (F f , A') G Tp A . The next lemma motivates the definition above. 

Lemma 4.30. Let (F, A, <fi) be a graph structure triple, and suppose that <fi is faithful in 
G m U {e}. Then the number of copies (in G rn ) of F rooted at <fi(A) destroyed by the addition 
of the edge e to G m is at most 

> max Nj,-(F~)(m). 

{F-,A-)eF~ A 

where the maximum is over faithful maps $r : A~ — > V(G m ). 

Proof. Let e = {u,v} be the edge added in step m + 1, and suppose that e destroys a copy 
F* of F, rooted at (p(A), in G rn . Note that this implies that either e G 0(F*) or e closes an 



35 We write F + v and F U {e} for the graph structures with (vertex, edge, open edge) sets (V(.F) U 
{v}, E(F), O(F)) and (V(F), E(F) U {e}, 0(F) \ {e}) respectively. 
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open edge of F*. We claim that there is a graph structure H C G m [V(F*) U {u, v}] such 
that 

H G Ntj,- (F~) for some (F~, A~) G Fp A and <jT : A~ ->■ V(G m ) (45) 

with <P~\a = and lm(0~) = <fi{A) U {-a, t>}. There are various cases to consider. 

Suppose first that e C <p(A); we claim that (45) holds with (F~,A~) as in case (a) of 
Definition 4.29. Indeed, e closes an open edge / G 0(F*), and so this open edge must have 
one endpoint (u, say) inside A and the other w outside. Since adding e closes /, it follows 
that {v,w} must be an edge of G m . Hence there must exist a copy of F~ in G m on the 
same vertex set, where F~ is obtained from F by adding the edge (h, say) corresponding to 
{v,w}. Clearly h £ 0(F), since {v,w} G E{G rn ), and moreover h G" F(F), since otherwise 
<j) would not be faithful in G m U {e}. 

Suppose next that e D <f>(A) = {u}. If v G V(F*) then it is immediate that (45) holds 
with (F~ , A~) as in case (b), so suppose not. Then adding e must close an open edge {u,w} 
of F*. It follows that {v, w} must be an edge of G m , and hence (45) holds with (F~, A~ ) as 
in case (c). 

Finally, suppose that e n <p{A) = 0. If {u,v} C V(F*) then (45) holds with (F~, A - ) as 
in case (rf). On the other hand, if {u,v} D V(F*) = then e cannot destroy F*. Hence we 
may assume that u G V(F*) and t> ^ V(F*), and that the addition of e closes an open edge 
{u, w} in F*. It follows that the edge {v, w} is an edge of G rn , and hence if w G A then (45) 
holds with (F~,A~) as in case (e), and if w £ A then (45) holds with (F~,A~) as in case 
(/) of Definition 4.29, as required. □ 

Remark 4.31. The observant reader will have noticed that the graph structures F + and 
F~ may have isolated vertices. However, using Remark 4.2, the event S(m) implies bounds 
on AV(F') for every pair (F', A') G F% A U Fp A , and every faithful ft : A' ->■ 7(G m ). 

We finish this section by proving some straightforward lemmas and observations which 
will be useful in later sections. 

Observation 4.32. Fp A C Tp A for every graph structure pair (F,A). 

Proof. This follows immediately from the definitions, since if (F', A') G Tp A then we are in 
either case (b) or (d) of Definition 4.29. □ 

Observation 4.33. Let (F, A) be a graph structure pair, and let (F', A') G 

(a) If A' C\F — A then N A ,(F>)(m) = ^ • iV A (F)(m). 
(6) // A' C H' C F' then H' D F ^ A. 

Proof. Both statements follow easily from Definition 4.29. Indeed, ii A' (1 F — A then we 
must be in either case (a) or (c) of that definition, and in both cases we have v A (F) = 
v A '{F'), o(F) = o(F') and e(F) = e(F') — 1. For the second statement, simply note that 
va'(F') = v A (F), i.e., every new vertex of F' is included in A'. □ 

Observation 4.34. Let (F,A) be a graph structure pair, and let (F',A') G Fp A . For every 
A' C H' C F' and every m G [to*], we nave N H '{F'){m) < N H r nF (F)(m). 
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Proof. This also follows easily from Definition 4.29. Indeed, setting H — H' n F, we have 
vh'(F') — v H (F), e(F') - e(H') ^ e(F) - e(H) and o(F') - o(H') = o(F) - o(H) 
in each of the cases (a)-(f). □ 

We make one more simple observation, which will play a crucial role in Section 4.4. Let 
T FA C Fp A denote the graph structure pairs in Tp A with v A >(F') < v A (F). 

Observation 4.35. Let (F, A, <p) be a graph structure triple, and suppose that the edge e 
which is added in step m + 1 of the triangle-free process is disjoint from <p(A). Then the 
number of copies of F rooted at 4>(A) which are destroyed by e is at most 

V max N#(F , )(m), (46) 

(F',A')e^ A v 

where the maximum is over faithful maps </)' : A' — > V(G m ). 

Moreover, we have Fp A C F* F A , and so the number of copies of F rooted at <f>(A) created 
in step m + 1 of the triangle-free process is also bounded above by (46). 

Proof. Proceed as in the proof of Lemma 4.30, noting that if e is disjoint from <f)(A) then 
the pair (F~ , A~) in (45) was obtained via either case (d), (e) or (/) of Definition 4.29, and 
that in each of these cases we have v A -(F~) < va(F), and so (F - , A~) G Fp A , as claimed. 
For the second part, simply note that if (F', A') G J~p A then we are in either case (b) or (d) 
of Definition 4.29. □ 

We next introduce a piece of notation which will be extremely useful in Sections 4.4 
and 4.5, below. Given a graph structure pair (F, A), set 

8(F, A) = C 3 v A {F) 2 + 2e(F) + o{F) 

and recall that A(F, A) =5(F,A) C . We shall write 

A(F-v,A):={5(F,A)-C) C . (47) 

Note that this is an upper bound on the value of A(F, A) one obtains by decreasing va(F) 
and increasing e(F) by one. We shall need the following properties of A(F — v, A). 36 

Lemma 4.36. Let (F,A) be a graph structure pair, and let (F',A r ) G F F A . If A' n F ^ A, 
then 

g F ,, A ,{t) < (logn) A ( F -^ 

for every < t ^ t A i (F 1 ) . 

Proof. The condition A' D F ^ A implies that we are in neither case (a) nor case (c) of 
Definition 4.29. Since v A /(F') < v A (F) in each of the other cases, the result follows easily 
from the fact that g F >,A>(t) ^ (\ogn)^ F '> A '^ for every < t < t A >(F'). □ 



36 See the Appendix for more detailed proofs. 
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Observation 4.37. Let (F, A) be a graph structure pair, and let (F',A!) G Tp A . Then 

A(F',H',A') ^ A(F-v,A) ^ A(F, A) - 3 y/A(F, A) 
for every A' C H' C F' . Moreover, the same bounds holds if H' = F' and A' D F ^ A. 

Proof. Note that if A' ^ H' ^ F', then 1 ^ v A >{H') = v A >(F') - v w {F') ^ v A ,(F') - 1. Both 
inequalities now follow easily from (44), using the convexity of the function x h- >■ x c . □ 

The next observation also follows by the same argument. 
Observation 4.38. Let (F, A) be a graph structure pair, and let (F',A') G Tp A . Then 

A(F' - v, A') ^ A(F, A) - 3 a/A(F, A). 
Observation 4.39. Let (F,A) be a graph structure pair, and let (F',A') G J~p A - Then 
A(F',A') ^ (l + e)A(F,A) ^ (1 + 2e)A(F - v, A). 

Proof. Note that A(F',A') <: (8(F,A) + 2)°, by (44), and recall that A(F - v,A) = 
c 

(S(F, A) -C) , by definition. Since S(F, A) > C 3 , the claimed bounds follow. □ 
Finally, we give three bounds which rely on one of the following assumptions 37 : either 

(l0gn)^ F <^ < n VA(F)+e(F)+l (4g) 

and t A (F) > 0, or (logn) A ( F -^) ^ n VA ^ F \ 

Lemma 4.40. Let (F,A) be a graph structure pair. If (\ogn) A ( F ~ v ' A ^ ^ n VA ^ F \ then 

e logn 



max |A(F, A), A(F', A') j - A(F — v, A) 



va(F) log logn 
for every (F',A') G Tp A . 

Proof. It follows from the definitions and (44) that 

max |A(F, A), A(F', A') j - A(F - v,A) ^ (S(F, A) + 2)° - (S(F, A) - C)° 

^2C-6(F,A) - 5{ ^ A) <:-■ Va{f)2 , 
since 8(F, A) ^ C 3 v A (F) 2 and by Observation 4.39. Hence, if (logn) A ( F -^ ^ n VA ^ F \ then 



max |A(F, A), A(F', A') j - A(F - v, A) 



4 logn e logn 

^ — < 



C ■ v a(F) loglogn v A (F) loglogn' 
as claimed. □ 

The proof of the next lemma is almost identical, see the Appendix for the details. 



37 We remark that when either of these inequalities is reversed, the conclusion of Theorem 4.1 (in the 
corresponding case) will hold trivially, see below. 
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Lemma 4.41. Let (F,A) be a graph structure pair with tA(F) > 0, and let (F',A') G T FA . 

If (\0g n)^ F ^ < n VA{F)+e{F)+l } fhen 



max { A(F', A') - A(F, A), x/A(F^4)} 



£ log n 



v A (F) log log n 
Finally, the following bound follows easily from Lemma 4.41. 
Lemma 4.42. Let (F,A) be a graph structure pair with t A (F) > 0, and let (F',A') e Tp A . 

If (\0g ^ n VA(F)+e(F)+l } then 

9F',A'(t) < n 1 ' 4 ** ■ g FiA (t) 

for every < t < t A > {F') . 

Proof. Simply note that, by Lemma 4.41, 

e logn 



-y(F', A') — 7(F, A) <: A(F', A') — A(F, A) + e(F) < 



va(F) log logn 
Since t ^ tA'(F') and i>a(.F) ^ 1, it follows that 

g F ,, A '(t) < (logn)^'> < n 1 ' 4 *-^*), 
as required. □ 

4.4. Balanced non-tracking graph structures. In this subsection we shall use the tools 
developed above to prove a slight strengthening of the bound in Theorem 4.1(c) for balanced 
pairs (F,A), i.e., pairs whose building sequence is either 

ACH = F or A = H C H l = F, 

see Definition 4.8. We shall prove the following proposition. 

Proposition 4.43. Let (F, A) be a balanced graph structure pair, and let tA(F) < t ^ t*. 
With probability at least 1 — n ~ 3logn , either (£(m) fl Z(m) fl <2(m)) c holds, or 

N ( p(F)(m) ^ max 

for every <p: A — > V(G m ) which is faithful at time t. 

In the case tA(F) = 0, we shall prove the following slightly stronger bound. 

Proposition 4.44. Let (F,A) be a balanced graph structure pair with tA(F) = 0, and let 
< f < t*. With probability at least 1 - n - 3logn , either (£(m) n Z(m) n Q(m)) c holds, or 

N^(F)(m) s= (logn) A(F -^ } (50) 
for every A — > V(G m ) which is faithful at time t. 
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Let us begin by sketching the proof of Propositions 4.43 and 4.44, and discussing why 
they are necessary for the proof of Theorem 4.1. When £a(-F) = we shall use induction on 
va(F), combined with Bohman's martingale method (i.e., Lemma 3.2). In fact, when va(F) 
is bounded we shall prove a much stronger bound, see Proposition 4.46, below. 

The harder case is when t A (F) > 0. First observe that, since t A {F) £ {0,t*}, it follows 
that o(F) > 0, and therefore we need to show that N^F) is decreasing for a while after time 
t = tA(F). We would like to use our usual martingale method (i.e., Lemma 3.1), but there is 
a problem: the single step changes in N^(F) can be very large. In order to get around this 
problem, we define a new variable (see (59), below), which counts the number of copies of F 
rooted at (f>(A) which are destroyed by the addition of an edge which is disjoint from <j){A). 
Using Observation 4.35, we will be able to give a sufficiently strong upper bound on the 
single-step change in this variable; moreover, we shall be able to bound its expected change, 
using the trivial observation that every open edge of F has at most one endpoint in A. 

Finally, we remark that the bounds (49) and (50) will be used to bound N<j,i{F')(m) for 
pairs (F',A!) £ Fp A which are balanced, when tA'(F') < t ^ t A (F) and gF,A(t) ^ 1- The 
following definition will allow us to assume that this bound holds when we need it. 

Definition 4.45. For each ml £ [m*], let JA(m') denote the event that the bound (49) holds 
for every balanced graph structure pair (F, A), every tA(F) ■ n 3//2 < to ^ ml and every map 
0: A — > V(G m ) which is faithful at time t, and that moreover (50) holds if t A (F) = 0. 

As noted above, the event A4(m) will be a crucial tool in Section 4.5, where we shall give 
an upper bound on \AN1(F)(m)\ in the case u < t ^ tA(F). Observe that (49) and (50) 
both hold trivially if (\ogn) A ^ F ~ v ' A ^ ^ n VA ^ F \ so we may assume otherwise. 

We begin by showing that when t A (F) = and v A (F) is bounded, we can obtain a much 
sharper result, which (almost) generalizes Proposition 3.7. 

Proposition 4.46. Let (F, A) be a balanced graph structure pair, and suppose that v(F) ^ ou 
and t A (F) = 0. Then, with probability at least 1 — n ~ 3logn , for every m £ [to*] either 
(£(m — 1) fl Q(to — l)) c holds, or 

N^F)(m) ^ (logn) 3 ^ (F) (51) 

for every </>: A — >■ V{G m ) which is faithful at time t. 

Let's begin by deducing Proposition 4.46 from Lemma 3.2 and Proposition 3.7. 

Proof of Proposition 4-46- We shall use induction on va(F) to prove that the event in the 
statement holds with probability at least 

l_ n v A {F) , n -i\ogn 

which implies the claimed bound 
since va(F) < u <^ logn. This holds for va(F) = 1 by Proposition 3.7, so suppose that 
va(F) ^ 2, and let us assume that the induction hypothesis holds for all smaller values of 
va{F). Recall from Definition 4.8 that the conditions that (F, A) is balanced and t A {F) = 
imply (and in fact are equivalent to) 

e(H) - 2v A (H) < e(F) - 2v A (F) for every A C H C F, (52) 

and note in particular (setting H — A) that t* A (F) = 0. 
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We claim first that, without loss of generality, we have e(F) = 2va(F) and o(F) = 0. 
Indeed, if o(F) > then we can simply remove all open edges from F; in doing so we only 
increase N^F), and we retain the condition (52). If e(F) > 2va(F) then there are two cases: 
either there exists a substructure A C H C F with 

e(H)-2v A (H) = e(F)-2v A (F), 

or there does not. In the former case, observe that (52) holds for the pairs (F, H) and (H, A). 
It follows by the induction hypothesis and Lemma 4.10 that 

NJF) ^ NJH)- max ify(F) < (l ogn fMH)+v H (F)) = (i ogn ^v A (F)^ 

<f>>: H-^V(G m ) 

with probability at least 1 — (n VA ^ + n VH ^)n~ 4logn : as required. In the latter case, i.e., no 
such H exists, then we may remove an arbitrary edge from F. Note that N$(F') ^ N</,(F) 
for the resulting graph structure pair (F', A), and moreover that (52) holds for (F', A), since 
e(F) > 2va(F). Hence (F',A) is balanced and t^F') = 0, as required. 

We are left to deal with the case e(F) = 2va(F) and o(F) = 0. If there exists a sub- 
structure A C H C F with e(H) = 2va{H) then we are easily done, exactly as above, by 
applying the induction hypothesis to the pairs (F, H) and (H, A). So assume not, and recall 
from (34) that, 

WW] = £ w- 

We claim that N A (F°) = (2t) e ( F )- 1 e" 4 ' 2 y/n for every F° G T° F , and that c(F°, A) = 2. 
Indeed, these statements follow immediately from the assumptions that e(F) = 2va(F) and 
o(F) = 0, and that e(H) < 2va(H) for every A C H C F, respectively. Thus, while the 
event £{m) fl Q(m) holds, we have 38 



N^F°)(m) e(F) ■ (logn)^- 1 )/^- 4 '^ (logn)^ F > 



where we used the bounds 2t < \/\ogn and e(F) = 2va(F) ^ 2u>. 

Now, set £ = va(F) and, for each to' G [m*], let T^(to') denote the event that (51) holds 
for every balanced graph structure pair (F',A') with tA(F) = 0, va'(F') < £ and v(F) ^ u, 
every to ^ vnl and every <fi: A — > V(G m ) which is faithful at time t. We shall bound, for 
each to G [m*], the probability of the event C^(m), defined as follows: 

£j(m) := %-l)nQ(ffl-l)nK,(m-l)n (7V (F)(to) > {\ogn) 3vA(F) ) . 

Fix m G [ra*]; we shall bound the probability of C F (m ). Subtracting the right-hand side 
of (53) from AN^(F)(m) and summing over to, we obtain a function 

to • (\ogn) VA(F) 



MUm) = N^F)(m) - 



n 



3/2 



38 Note that we must consider separately the cases t < to and u < t < t*, and observe that l t ^ w • fF°,A(t) + 
h>u ■ 9Fo, A (t) ^ n- £ ■ (logn)T( F - A ) < 1 since v(F) ^ uj and c(F°, A) = 2. 
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defined on [mo], which is super-martingale while £{m) fl Q{m) holds. Our plan is to apply 
Lemma 3.2 to M F ; in order to do so, we first claim that if 1Zi>(m) holds then 

^ AN^(F)(m) ^ e(F) ■ (\ogn) 3vA{F) - 3 . (54) 

The lower bound is trivial, since o(F) =0. To prove the upper bound, recall from the 
previous section the definition of Tp A , and Lemma 4.28. We claim that if 7^ (to) holds then 

iV>(F+)(m) < (logn) 3 ^- 3 (55) 

for each (F + ,A + ) G Tp A , which clearly implies (54). 

To prove (55), simply note that e(A + ) < 2va{A + ), by (52) and since e(F) = 2va(F), and 
that F + = F A+ . It follows that (52) holds for the pair (F+,A+), and hence (F+,A+) is 
balanced and t A +(F + ) = 0. The bound (55) now follows from the event TZ^m), and the fact 
that A + 7^ A, as claimed. 

Set a = e(F) • (logn) 3 ^^" 3 and (3 = (}ogn) 3vA ^ /m ^ (\ogn) VA 

(F) / n 3 / 2 ^ anc i observe 

that 

a-P-mo = e{F) ■ (log nf . 
Writing /C(m) = £ (to) D Q(to) fl TZe(m) and applying Lemma 3.2, it follows that 

F(£j (mo) ) < P ((m/M > 02^^12) nK(m „ _ !)) ^ exp (-|^) . 

Summing over choices for too, and adding this to the probability that 7^ (to*) fails to hold, 
it follows (using the induction hypothesis) that the probability that 

£{m - 1) n Q(m - 1) n (A^(F)(to) > (logn) 3 ^ (F) ) 

holds for some to G [to*] is at most 



n ^41ogn 



£'=1 (F',A') 

Summing over the (at most most y/n) choices of 0, we obtain the claimed bound on the 
probability of the event in the statement. □ 

Using almost the same proof, we obtain Proposition 4.44. 

Proof of Proposition 4-44- We use induction on va{F) to prove that if (F,A) is balanced 
and tA(F) = 0, then with probability at least 1 — n VA ^ ■ n _41ogra , for each to G [to*] either 
[£{m — 1) fl Q(to — 1)) holds, or 

A^(F)(m') < (\ogn)^ F - v ^ (56) 
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for every 0: A — > V(G m ) which is faithful at time t. The proof is almost identical 39 to that 
of Proposition 4.46, the main difference being that the bound (53) becomes 

^ N+(F°)(m) ( {\ogn)^ A ) \ (logn)^* 1 ) 
F h, OH H + ) n 3 / 2 ' (57) 

if £{m) fl Q(m) holds, using the fact that (F°,A) is balanced, and noting that 

W ' f F o, A (t) + ' 9Fo, A (t) < n~ e ■ (logn)^ A) , 
which follows since c(F°, A) = 2. Note also that the bounds in (54) become 

< AA^(F)(m) < (log™) A(F+ ~"' A+) < (logn) A ( F -^ A )- 3 , 

since F + = F A+ and A + ^ A. 

Now, let lZt(m') denote the event that (50) holds for every balanced graph structure pair 
(F', A') with t A (F) = and v A >(F') < t = v A (F), every m < m! and every 0: A ->■ l / (G m ) 
which is faithful at time t. Define 

£j(m) := £(m - 1) D Q(m - 1) D 7^(m - 1) n (i\^(F)(m) > (logn)^-^) 

and fix mo G [m*]; we shall bound the probability of the event C F (mo). Indeed, subtracting 
the right-hand side of (57) from AN,p(F)(m) and summing over m, we obtain a function 



3/2 

defined on [m ], which is super-martingale while £(m)fl<2(m) holds. Let a = (logn) A ( i? ~ 1, ' j4 ^ 3 
and 

(logn)^"^ (logn) A ( F "^)- 1 / (logn)^ F ' A )^ (logn)^( F ) 



m n 3 / 2 \ n 6 J n 3 / 2 

where the final inequality holds since (\ogn) A ^ F ' A ^ ^ (logn) A ( i? ~ 1, ' A ) • n £ / 2 , by Lemma 4.40, 
since (logn) A( - F ~ v ' A ) ^ n^ 1 ^, by assumption. 

Writing /C(m) = £(m) fl <2(m) fl TZ((m) and applying Lemma 3.2, it follows that 

P(£j(m )) < P^Mf(mo) > i 1 ^!^^ n /C(m - 1)) < n"^". 

The claimed bound now follows by the induction hypothesis, exactly as before. □ 

We now turn to the more substantive part of this subsection: the proof of Proposition 4.43 
when t A (F) > 0. Note that in this case we have o(F) > 0, since < t A (F) < t* . We begin 
by noting that at time t = t A (F), the bound on N < p(F)(m) given by Theorem 4.1(6) implies 
that required in (c). 



39 Note that our applications of the induction hypothesis still work in exactly the same way using the 
convexity of the function x ^ x c . 
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Lemma 4.47. Let (F,A) be a balanced graph structure pair with < t A (F) < t* , and set 
t = tA(F). For each injection <fi: A — > V{G m ), if 

N^F)(m) e (l±g F , A (t))N A (F)(m), 

then 

N^(F)(m) < (logn) A(F ' A)_1 . (58) 
Proof. We claim that, at time t = t A (F), 

(l + g FA (t))N A (F)(m) = (l + (logn)^) • (2tf F) < ( logn) A(F,A)_1 , 

from which the lemma follows immediately. The equality follows from the definitions (10) 
and (26) of t* A {F) and gF,A(t) respectively, the fact that (F, A) is balanced, which implies 
that t* A (F) = t A (F), and the fact that e ct = n 1 ^ when t = t A (F), since we are assuming 
that t A (F) < t*. The inequality follows from the definition (24) of j(F,A), i.e., 

1 (F,A) = A(F,A)-e(F)-2, 

and the fact that 2t < y/\ogn, since t < t*. □ 

In order to see that (58) implies the bound in Theorem 4.1(c), simply note that the building 
sequence for (F, A) is 

A — H CH 1 — F 

since (F,A) is balanced and t A (F) > 0. Thus t\ = t A (F) (since t A (F) < t*), and so, by 
Lemma 4.6, the minimal A C H C F with t < tn(F) is equal to F for every t ^ t A (F). 

We are left with the task of showing that N^F) continues to decrease for some time after 
t A (F), and moreover does not increase again too much later on; we shall do so using a slight 
variant of the martingale method of Section 3. In particular, we shall work with the original 
(un-renormalised) random variables, and we will prove only an upper bound on N^(F). 

More precisely, let us define for each balanced pair (F, A) a corresponding Line of Death: 

L D A {F){m) = max{e-°( F )( t2 -^( F ) 2 )(logn) A ( F ' A ),(logn) A ( F -^}, 

and a Line of Peril, L^(F)(m) = § ■ L A (F)(m). For each t A (F) ■ n 3/2 < m < to*, let 

£J (m) = {L p A (F)(m) < N,(F)(m) < L D A {F){m)} 

denote the event that A^(F)(to) lies between these two lines, and for each pair (r, s) G N 2 
with t A (F) ■ n 3 / 2 ^r^r + s^m*, define 



r+s— 1 

Cfrs) = {N,(F)(r) < L p A (F)(r)} n f| C F (m) n {N*(F)(r + s) > L D A {F){r + s)}, 

m=r+l 



so £ F (r, s) holds if A^(F)(to) crosses the Line of Death in step r + s, and crossed the Line 
of Peril for the last time (before step r + s) in step r + 1. It follows immediately from the 
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definition and Lemma 4.47 that if £(m) holds but (49) does not, for some < t ^ t*, 

then the event C?(r, s) holds for some pair (r, s). Given such a pair (r, s), set 40 



so = mm < s, 



n 3 



o{F) ■ r J ' 

and observe 41 that L A (F)(r) < C ■ L®(F)(r + s ). We define a random variable 42 as follows: 

m'—l 

«"0 = E (^("O - D,( F )(m) + \AL°(F)(m)\ + ^ • L^(F)(m)) (59) 

for each r ^ m' ^ r + Sq, where 

• C^{F){m) denotes the number of copies of F rooted at <fi(A) which are created in 
step m + 1 of the triangle-free process, and 

• D ( f ) {F){m) denotes the number of copies of F rooted at 4>{A) which are destroyed in 
step m + 1, if the edge e m+ i 43 is disjoint from (f)(A), and D^(F)(m) = otherwise. 

The motivation for this slightly convoluted definition is given by the following lemma, which 
follows easily from the fact (see Observation 4.35) that if e m +\ is disjoint from <p(A), then 
the single-step change \ AN < p(F)(m)\ cannot be too large. 

Lemma 4.48. Let (F, A) be a balanced graph structure pair with tA(F) > 0, let 

t A (F) ■ n 3/2 < r < m < r + s < m*, 
and suppose that <fi: A — > V(G m ) is faithful at time t. If £{m) holds, then 

\AM%(m)\ ^ (\ogn) A ^ F - v ^- c . 

Proof. Observe first that 

\AL°(F)(m)\ < (2o(F)+e) * . L°(F<)(m) < O^")^^ , (60 ) 

by Lemma 4.40. Thus it will suffice to bound C <t> (F)(m) + b <t> {F)(m). Recall that F* FA 
denotes the graph structure pairs in Fp A with va'(F') < v A (F). We claim first that 



C^F)(m) + D^F)(m) < £ max N^(F')(m) (61) 

(F>,A>)er FiA ; A -* V{Gm) 

where the maximum is taken over faithful maps </)' : A' — > V(G m ). Indeed, this follows 
immediately by Observation 4.35 and the definition of D l f ) (F)(m). 
We will show that 

AV(F')(m) < (logn) A(F '' A,) < (logn) A(F -^ A) - 2C . (62) 



40 Note that r ^ uj ■ n 3 / 2 , since t A {F) ^ to. 

41 This follows since s < n 3 / 2 < r, which implies that o(F)((r + s ) 2 - r 2 ) = 0(n 3 ). 
42 Note that, to simplify the notation, we suppress the dependence of on the pair (r, s). 
43 That is, the edge which is added to G m in step to + 1 of the triangle-free process. 



THE TRIANGLE-FREE PROCESS AND R(3,fc) 49 

To prove (62), we claim first that t > t H >{F') for every A' C H' C F'. To see this, observe 
that since (F, A) is balanced, by Lemma 4.7 and Observation 4.34 it follows that 

N H ,(F')(m) ^ N H , nF (F)(m) ^ (2t)< F )-< H '^ F ) <; (2tf F '^ H '> 

for every t ^ t A (F), since every edge of E(H') \ E(H) is also in F(F') \ E(F). Therefore 
t ^ tH'{F'), as claimed. Using the event S(m), the definitions (see (47)) of A(F,A) and 
A(F - v, A), and the fact that v A ,(F') < v A (F) for every (F f , A') e T FtA , we obtain (62). 

Finally, recalling that \Tp A \ ^ 5v A (F) 2 ^ (logn) 2 / 5 , the claimed bound now follows 
from (61) and (62). □ 

We shall need two more properties of M F {m). Together with Lemmas 3.1 and 4.48, they 
will easily imply Proposition 4.43. The first connects the event C F (r, s) and the function M F . 
Note that L A (F)(m) is decreasing on t > t A (F). 

Lemma 4.49. IfC F (r,s) holds, then M F (r + s ) > ^™ • L%(F)(r + s ). 
Proof. Suppose first that s = s. Then by (59) we have 

M F (r + s ) > (iV (F)(r + s) - A^(F)(r)) + (L D A {F){r) - L D A {F){r + s)) 

> {L D A {F){r) - N,{F){r)) > \-L°(F)(r), 

since the inequalities A^(F)(r + s) > L%(F)(r + s) and A^(F)(r) ^ \ -L%{F){r) follow from 
the event C F (r, s). On the other hand, if s — n 3 /(o(F) ■ r), then we have 

(N,(F)(r + s ) - N,(F)(r)) + {L°(F)(r) - L A (F)(r + s )) 

> \-(L%(F)(r)-L°(F)(r + so)) > 0, 
since N <p (F)(r + s ) > § ■ L°(F)(r + s ) and N^F)(r) < | • L°(F)(r), by £j(r, s). Hence 

r+so-l 

M£(r + So) > E ^ 7 7 F T- L ^)(r + S o), 

m=r 

as claimed. □ 

The next lemma implies that M F is a super-martingale on [r, r + so], and gives bounds on 
its expected step-size. 

Lemma 4.50. Let (F, A) be a balanced graph structure pair with t A (F) > 0, let 

t A (F) ■ n 3/2 < m ^ m*, 

and suppose that 0: A — >■ l / (G m ) faithful at time t. If (logn) A ( F ~ v ' A ^> ^ n VA ^ and 
C F (m) n £(m) D Z(m) D Q(m) JioZtfe, £/jen 

E[AM;(m)] *C 

and 

E[\AM F (m)\] < ^^-^(F)(m). 
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In the proof of Lemma 4.50 we shall need the following bound on N ( f l (F°)(m). 

Lemma 4.51. Let (F,A) be a balanced graph structure pair with tA{F) > 0, let 

t A {F) ■ n 3/2 < m < m*, 

and suppose that 0: A — > V(G m ) is faithful at time t. If (logn) A ( F ~ v ' A ^ ^ n v A{F) an d ^ e 
event £{m) holds, then 

N 4 F°)(m) < 

logn 

for every F° G T F . 

Proof. Observe first that, since (F,A) is balanced and t A (F) < t*, we have t* A (F) = t A (F) 
and therefore e Ao{F)tA{F)2 = n VA ^~ e ^ 2 . Similarly, we have t* A (H) > t A (F), and hence 
e 4o(H)t A (F)i ^ n v A (H)-e(H)/2^ for every ACHCF. It follows that 

N A (F)(m) - (2tY {F) e- 4o{F){t2 - tA{F)2) < L A( F )( m ) 

and hence 44 

N A (H°)(m) > N A {H){m) > (2t) e W e - 4o W(* 2 -^(F) 2 ) ^ ( 2 t) e(f °~ e(F) N A (F)(m) (63) 

for every ACHCF, every H° G F° H , and every < t < f . 

Next, recall that 8t 2 • iV A (F°) = Y(m) ■ N A (F), by Observation 4.20, and that t A (F) > to, 
by Observation 4.16. Hence, if t ^ t A (F°) then, since the event £(m) holds, we have 

N+{F°)(m) < (l + 9FO , A (t))NA(F°)(m) = (l + g FO , A (t)) . XMl^M 

l + g F o, A (t) Y(m)-L%(F)(m) < Y(m) ■ L°(F)(m) 
^ (\ogn) A ^ A )-< F ) 8t 2 ^ logn 

as claimed. On the other hand, if t > t A (F°) then let A C H C F° be minimal such that 
t < t H {F), and suppose first that H ^ F°. Then, using (63), we have 

N,(F°)(m) < (logn)^°^)iV,(nM = (logn)*<^> • 

^ Y(m)-(\ogn) A ^ H ^-(2t) e ^- e ^ ^ Y (m) ■ L^(F)(m) 

logn 

since A ^ E ^ F° , and hence L A (F)(m) > (logn) A ( F -^) > (logn) A ( F °'" ff ' >l > +e ( F ) +3 , uging 
the convexity of the function x >->■ x c . Finally, if t > t A (F°) and H = F° then we have 



44 



Here we use Observation 4.20 and the fact that 2te 4 * < y/n for all t ^ t* . 
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since Y(m) > n £ and {\ogn) A<yF ~ v,A) < n VA< ^ F \ by assumption, which implies that 

L D (F)(m) > (W„^-M) > ( l0g ") A(F ' A) > Qog") A(F ° >A) 
L A (^j(mj > (log ra) ^ -jj- 2 > , 

by Lemma 4.40. This completes the proof of the lemma. □ 

We can now prove Lemma 4.50. 

Proof of Lemma 4-50. We begin by recalling (60), and that 

AM F (m) = C^F)(m) - D^F){m) + \AL°(F)(m)\ + ■ L D A {F){m). (64) 

It follows that our task is to bound E[Q(F)(m)] and E[D (F)(m)] in terms of L^(F)(rn). 
In order to do so, we shall essentially repeat the proof of Lemma 4.21, except ignoring copies 
of F which are destroyed by edges which intersect 4>(A). 

Indeed, recall first, from (34), that the expected number of such copies of F created in a 
single step is exactly 



Next, similarly as in (36), and using the event Z(m), we claim that the expected number 
of copies of F rooted at <f){A) which are destroyed by the addition of an edge disjoint from 
4>(A) in step m + 1 is at least 

1 ( fl-e 



E ( E {^)Y{m) - o{Ff{\ognfy 



Q(m) 

To see this, simply note that each open edge / G 0(F) has at most one endpoint in A, 
and hence there are at least (1/2 + o(l))Y(m) open edges which close / and are disjoint 
from <f)(A), since the event £(m) holds. 

Noting that Y(m) 3> o(F) 2 (logn) 2 , we obtain 

E[^(F)(m)-A»(F)(m)] < -Lj ( £ N<t> (F°) - Q - e ) o(F) ■ Y(m) ■ N*(F)j , 
and hence, by Lemma 4.51, 

E[ C ^)( m) -^)W]^ (m) ' iS(F)(m) ^ (F) ° (F) 



Q(m) \logn 3 

since the event £ F (m) implies that JV^,(F)(m) ^ (| — e)L^(F)(m). 

Finally, recalling that o(F) > (since < < **) and that e(F) <C logn, observe 

that 

Y(m).L D A (F)(m) (e(F) o{F)\ 5o(F) t tD(jpV , 
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since the event Q(m) holds and 8/3 > 5/2. By (60) and (64), and since m ^ r, it follows 
that 

E MM] < (- 5 -^T-^ + (MF)+e). -L + ^).L^)(m) < 0, 
as required. 

The proof of the claimed bound on E[|AM^(m)|] is almost identical. Indeed, since the 
event C F (m) implies that N^{F){m) ^ L^(F)(m), repeating the calculation above gives 

E[|A«m)|] < E[C»(F)(m) + D t (F)(m)] + |AiJ(F)(m)| + £5 • L°(F)(m) 

« (((4 + E ) + (2 + £ )).^ + ^).Lj(F)M « L^*. L J (F)(m) 

as required. □ 

Finally, we are ready to prove Proposition 4.43. 
Proof of Proposition 4-43. We are required to prove that, for every tA(F) ■ n 3 / 2 ^ m ^ m*, 

p(M(m) c n^(ffl)nZ(ffl)nQ(m)) < n- 31ogn . (65) 

Recall that the event M.(m) c D £(m) implies that the event C F (r,s) holds for some pair 
(r, s). We claim that, for each faithful <p: A — > V(G m ) and each triple (m',r,s) G N 3 with 
tA(F) ■ n 3 / 2 ^ r ^ r + s ^ m' ^ m*, we have 

P(£j(r, s) n £{m!) n Z(m') n Q(m')) < n- (logn)2 . (66) 

By the union bound, this implies (65), and so will be sufficient to prove the proposition. 

In order to prove (66), we shall apply Lemma 3.1 to M F . Recall that the event C F (r, s) 
implies that M F (r + s ) ^ 4^py • L%(F)(r + s ), by Lemma 4.49, and set 

a = (hgn) A< ^ F - v ^- c and (3=-- L°(F)(r). 

Set K,(m) := P| C F {m!) D £(ra) fl Z(m) fl <2(m), and recall that if K,(m) holds then 

(a) \AM£(m)\ < a, by Lemma 4.48, 

(6) E[|AMj(m)|] < (3, by Lemma 4.50, since L A (F) is decreasing and s ^ rrm, 
(c) L A (F)(r + so) ^ /3soj since L A (F) is decreasing and o(F) > 0. 
Moreover, M^f is a super-martingale on [r, r + so], by Lemma 4.50, and 

a ■ f3 • so ^ C 2 ■ (\ogn) A ^ F - v ^- c ■ L D A (F){r + s ), 
since L®(F)(r) < C ■ L A (F)(r + s ), as noted earlier. Hence, by Lemma 3.1, we have 

F^M F (r + so) > -J^ ■ L D A {F){r + s )) n AC(m - 1)) < exp ( - (logn) 3 ) , 
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which implies (66). Finally, summing over choices 45 of and (r, s), the proposition follows. 

□ 

4.5. Bounding the maximum change in N^(F). We now return to the range u < t ^ 
t A (F), and use the event M(m) (defined in the previous section), together with the tools 
developed in Section 4.3, in order to bound the maximum possible single-step change of the 
function N^(F). The aim of this subsection is to prove the following lemma. 

Lemma 4.52. Let (F,A) be a graph structure pair, let ui < t ^ t A (F), and suppose that 
{\ogn)^ F A) < n v A (F)+e{F)+i_ ifS{m)f\M{m) holds, then 



\AN;(F)(m)\ < (logn)V^.^^L 

1 + Qf A ' 



for every 0: A — >• V{G m ) which is faithful at time t. 

We shall need one more straightforward lemma. 
Lemma 4.53. Let (F, A) be a graph structure pair. If A C H C F and u < t ^ t A (F), then 

Proof. Set a = 2v A (H) - e(H), b = o(H) and c = c(F, A). Note that 

ac ^ 2b 

by the definition (11) of c(F, A), that a ^ 1, since £a(F) > and H ^ A, and that 

N A (H)(m) = (2t) e{H) n a/2 e~ m \ 
by (22). Since = n^^e* 2 (\ogn)^ F ' A ) and n -V4 e rfA(F) 2 ^ x we obtain 

V 9F,A\t) J 

for every u> < t ^ tA(-F)) as required. □ 

We are now ready for a key calculation. 

Lemma 4.54. Let (F,A) be a graph structure pair, let u < t ^ t A (F), and suppose that 
(\ogn)^ F > A ) ^ n v A (F)+e(F)+i_ if S{m) f\ M{m) holds, then 

N^{F'){m) ^ (lognrV^) . ■ JV A (F)(m). (67) 

/or every (F', A') e Tp A and every 0': A' — >■ V{G rn ) which is faithful at time t. 



2 



45 Note that, since \A\ ^ (logn) 1 / 5 , we have at most n choices for </>. 
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Proof. We split the proof into three cases. 
Case 1: u < t < t A ,{F') and A' n F = A. 

The condition A' (1 F = A implies that 01 • N A >(F') < 2* • N A (F), by Observation 4.33. 
Suppose first that gF,A(t) < 1. Then, since u < t ^ t A >(F'), by the event £(m) we have 

Of 

N^F') < (l + ^, A ,(t))-iV A ,(F') < (logn)^ F '' A ') +1 -^.7V A (F) 



Note that we used Observation 4.39 and the fact that gF, A {t) 2 y/n ^ (logn) 27 ^'^ in the third 
step, and our assumption that <7f,a(0 ^ 1 in the fourth. 

On the other hand, if gF,A(t) ^ 1 then, since (logn) 7 ^ F,A ^ ^ n v M F )+ e ( F )+ 1 anc l u < t ^ 
min {t A (F),t A >(F')}, we have 

Of 

N(f>i(F') < (1 + ^ (*)) • Na>(F') < n 1 ' 4 * 26 • ^(t) • -= • N A (F) : 

v ^ 



by Lemma 4.42 and the event £{m). Noting that (log n)V A ( F ' A ) ^ n e , by Observation 4.41, 
it follows that 

W) < ^-i/4+3e _ gp A {t) • N A (F) < (logn)- 2 v /A W) ^(0 2 ^(F) 
since 5f,a(^) ^ 1, as required. 

Case 2: w < i < ti'(-F') and A' n F ^ A. 

By Observation 4.34 and Lemma 4.53, we have 

since A'flF^A Moreover, g F ',A'(t) ^ (logn) A(F ' A) ~ 3 v /A ( F ' A ) ) by Lemma 4.36 and Obser- 
vation 4.37. Using the event £{m) and the fact that lu < t ^ t A /(F'), it follows that 

AV(F') < (1 + ^(f)) • iV A ,(F') < - 9 [ At)2 u , ■ (logn)- 2 v /A ^) . N A (F), (69) 

as required, since 1 + g FtA (t) < (logn) 7 ^'^ 1 and 7(F, A) + a/A(F, A) > A(F, A) + 1. 
Case 3: i > t^(F'). 

Let A' C. H' C. F' be minimal such that t < tH'(F'), set H — H' C\ F, and suppose first 
that either H' ^ F' oi A' (1 F ^ A. Then, since S(m) holds and t > t A >(F'), we have 



ify(F') < (logn^'-^iMF') ^ (logn) A ^- 3 v /A (^)iV H (F). 
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where the second inequality follows from by Observations 4.34 and 4.37. Since H ^ A, by 
Observation 4.33, and N A (F) = N A (H) ■ N H (F), we may apply Lemma 4.53 to N A (H), as 
in (68), and hence obtain 

AV(F')(m) < 1 9 [ At) * ■ (logn)- 2 V^) ■ N A (F)(m), 

exactly as in (69). 

So suppose now that H' = F' and A' n F = A, let A' C H C • • • C H e = F' be the 
building sequence of (F',A'), and note that since if' = F', we have t ^ tg, by Lemma 4.6. 
If (F',A') is unbalanced then, by Lemma 4.11, Observation 4.37 and the event £(ra), 

N^F'Xm) ^ (logra)*^- 1 ^ < (logn^^v 7 ^^), 

which again implies (67), using Lemma 4.53 to bound N A (F). On the other hand, if (F, A') 
is balanced then, since the event M.{m) holds, we have 

AV(F')(m) < max{ e -°( F ')( t2 -^( F ') 2 )(logn) A ( F ' A '),(logn) A ( F '-^')}. 
By Observation 4.38, if 

iV(F')(m) < (hgn) A( - F '- v ^ < (logn) A ( F ' A )- 3 v / ^) 
then we are done as before, so let's assume that t A >(F') > 0, and that 

AV(F')(m) < e-^'^-^'^ihgn)^'^. (70) 

Note that moreover o(F) = o(F') > 0, since A' D F = A (see Definition 4.29). 
Next, we claim that if either gF,A{t) ^ 1 or c(F, A) = 2, then 

N,,(F>)(m) < (logn) A ^') < i 1 ^- * , ( 71) 

1 + 9F,A{t) 

by Observation 4.39 and the event £{m). Indeed, if gF,A(t) ^ 1 then this is immediate, and 
otherwise we have 

1 < gpAt) < n- £ (logny^ <: n- £ / 2 (logn)^ A )- 3 v /X ^), 

where the second inequality follows (for every t ^ t A (F)) since c(F, A) = 2, and the third 

holds because (logn)v^ A ( F,j4 ) ^ n £ / 6 , by Lemma 4.41. Using Lemma 4.41 once again, it 
follows that 

(log u) A ^ < n £ / 2 (logn) A ^ A ) < ^ , 

1 + gF,A\t) 

as claimed. The bound (67) now follows immediately from (71), using Lemma 4.53. 
It remains to deal with the case in which (70) holds, (F',A') is balanced and 46 

1 < g F ,A(t) < e^^^Qogn)^. (72) 



46 Recall that e c(f,A)t A (F) a < „i/4_ 
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where < t A >(F') < t < t A (F) ^ t* A (F) and c(F, A) > 2. We claim that 



t A {Ff-t A ,{F'f > (73) 

To see this, recall first that t* A ,(F') = t A >(F'), since (F', A') is balanced and t A >(F') < t*, and 
that o(F) = o(F'). Recall also from (28) that 8o{F)t* A (F) 2 / logn and 8o(F')t* A ,(F'f / logn 
are both integers. Since t A i(F') < t* A (F), it follows that these integers are distinct, and 
hence (73) holds. 

Suppose first that o(F)(t A (F) 2 — t A >(F') 2 ) > e ■ logn, and recall from Observation 4.16 
that c(F, A) ^ o(F)/v A (F). It follows that 

o(F)(t 2 -MFr)+c(F,A)(t A (F) 2 -t 2 ) > ^ry(tA(Ff-t A/ (F') 2 ) > (74) 
and by Observation 4.41 we have 

n sMF) ^ (logn) A(F',A')-A(F,A) + 3 v ^A) > (?5) 

Now, by (70) and (72) we have 

AV(F')(m) < e -o(F)(t>-t A ,(F>)2)( logn) A(Fi,A>) 

n no . r ,\A(F',A')+ 7 (F,A) 
^ e -o(F)(t>-t A ,(F'f )~c(F,A){tA{F) 2 ~ti) . \ l0 Sn) 

QfA 1 ) 

and hence, by (74), (75) and Lemma 4.53, and since gF,A(t) ^ 1, 



MOM < ^TZT* < i^nr^K ^ , A(FKm) , 

^+9F,A\ t ) L+9F,A{t) 

as required. 

Finally, suppose that o(F){t A (Ff - t A ,(F') 2 ) ^ e ■ logn. It follows from (73) that 

o(F){t* A (F) 2 -t A (F) 2 ) > Q-^logn, 

and hence, since 8o(F)t* A (F) 2 = (2v A (F) - e(F)) logn, by (28) 47 , we have 

N A (F)(m) = (2t) e ^-exp(4o(F)(^(F) 2 -t 2 )) > n 1 ' 2 '^ . 

Since gF,A{t) ^ n _1 / 4 (logn) 7( ^ i? ' j4 - ) , it follows, using (70), (75) and Lemma 4.41, that 
7V>(F')(m) < (logn) A(F '- A,) ^ n £ ■ (logn) 7(F ' A) < n 1/4+£ • g F<A {t) 

^ ^ _1 / 4 + 5e . g FA (t) ■ N A (F)(m) <: (logn)- 2 ^^) . ^ (t) * ■ N A (F)(m), 
since g F , A {t) > 1 and (logn)v^*^ ^ n £ , as required. □ 



Note that < t* A (F) < oo, since t A (F) > and o(F) > 0. 
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Using Lemma 4.54, we can now easily bound \AN^(F)(m)\. 

Proof of Lemma 4-52. By Lemmas 4.28, 4.30 and 4.54, the maximum number of copies of F 
rooted at 4>(A) which can be either created or destroyed by the addition of a single edge is 
at most 

\^,a U Ha\ ■ (logn)- 2 v^) . . N A{F )(m). 



Moreover, by Lemma 4.23, we have 

iAJvj(F) (m )i < 2 . ( |AAf *< F >< m)l + . ^ 

* \g F ,A(t)N A (F)(m) g F At) « 3 ' 2 
Since gF,A(t) > rT 1 ^ and t> (F) <C logn, it follows that 



(*) 



< (lognj-v 7 ^). g^g) 



as claimed. □ 

4.6. The land before time t — u. When t is bounded, the variables N<p(F) are not 
self-correcting, and so we cannot use the martingale technique introduced in Section 3. 
Fortunately for us, however, the faster-growing bounds given by the method of Bohman [10] 
suffice for our purposes. In this subsection we shall state the bounds we obtain in the case 
< t < bo < tA(F), and give an extended sketch of their proof. Since the ideas used in this 
section are not new, we postpone the details to the Appendix. 
Recall that 

f FtA (t) = e C(o(F)+l)( t » + l) n -l/4 (logn) A(F,A)- > /A(^ 

for each graph structure pair (F, A), and that K, £ (m) = y(m) r\Z(m) n Q(m). The following 
proposition shows that Theorem 4.1(a) is unlikely to be the first of our constraints to fail. 

Proposition 4.55. Let (F,A) be a graph structure pair, and let < t ^ u < tA(F). Then, 
with probability at least 1 — n ~ 3l ° sn , either [£{m -l)nM(m-l)(l JC £ (m — l)) c holds, or 

N^F)(m) G N A (F)(m) ± f FA (t) ■ N A (F)(n 3/2 ) (76) 
for every <f>: A — > V(G m ) which is faithful at time t. 

The proof of Proposition 4.55 relies heavily on the fact that the event y(m) gives us 
stronger bounds (in the range t ^ u) on the variables Y e than those given by the event 
£{m). Recall that 

f y {t) = e c ' 2 n- 1 /4 (logn) 5/2 and m = e -^ fy{t) . 

The following proposition is essentially due to Bohman [10], although he stated only a slightly 
weaker version of it; for completeness, we give a proof in the Appendix. 
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Proposition 4.56 (Bohman [10]). Let to < to ■ n 3 / 2 . With probability at least 1 — n~ 41ogn 7 
either (Z{m — 1) n Q(m — l)) c holds, or we have 

X e (m) G X{m) ± f x (t)X(n 3/2 ) and Y e (m) G Y(m) ± f y (t)Y(n 3/2 ) 
for every e G 0(G m ). 

Finally, let us note that y(m — 1) =>• Q(m) in the range to < u ■ n 3 / 2 . 
Proposition 4.57. For every m ^ u ■ n 3//2 7 ify(m — 1) /ioWs i/ien 



Q(m) Ge-^Q ± £ ./ w (t)Q). 



Proof. Recall that AQ(m) = —Y e {m) — 1, where e is the edge chosen in step to + 1 of the 
triangle-free process. Noting that, for every m' ^ cu ■ n 3 / 2 , we have 



m — 1 /■ n m — 1 

^F(m) G (l-e^Q )±n and £ / w (f) < - • / s (f). 

m=0 ^ ' m=0 

It follows that if y(m' — 1) holds, then 

Q(m') G (") " E (^H ± /«(*)^(n 3/2 )) C Q(m') ± £ • /#) (") - 

^ ' m=0 ^ ' 

as claimed. □ 

We shall next use Bohman's method to control the variables N^(F) in the range t ^ u. 
Let us fix a graph structure triple (F,A,(f>) with tA(F) > 0. The first step is to break up 
N^F) as follows: 

m'-l 

N^F)(m') = J2 ( C <p( F )( m ) ~ D*(F)(mj), (77) 

where C^(F)(m) denotes the number of copies of F rooted at <fi{A) which are created at 
step to + 1 of the triangle-free process, and D^F^m) denotes the number of such copies 
which are destroyed in that step. We shall need bounds on the expected and maximum 
possible single-step changes in C ( p(F)(m) and D^(F)(m). Since the proofs of these bounds 
are straightforward, and somewhat technical, we defer the details to the Appendix. 

Lemma 4.58. Let (F, A, 0) be a graph structure triple, and suppose that < t ^ ou < t^F), 
and that <f> is faithful at time t. If £{m) fl Q{m) holds, then 

i Gm] - ^hMpM e ± M'yw , 
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Lemma 4.59. Let (F, A, <p) be a graph structure triple, and suppose that < t ^ to < tA(F), 
and that <fi is faithful at time t. If £{m) fl y(m) fl Z(m) D Q(m) holds, then 

In order to apply Lemma 3.2, we shall also need bounds on C^(F)(m) and D^{F){m) 
which hold deterministically for all < t ^ u. 

Lemma 4.60. Let (F, A, 0) be a graph structure triple, and suppose that < t ^ to < t A (F), 
and that <p is faithful at time t. If £{m) fl M.{m) holds, then 

< C*(F)(m) < min |n £ , (logn) A ^/ 2 | • = N A {F){n z ' 2 ). 

Moreover, the same bounds also hold for _D^(F)(m) . 

We can now apply Lemma 3.2 to the variables C^F) and D^F); we again refer the reader 
to the Appendix for the full details. 

Sketch proof of Lemma 4-55. For each m e [m*], set /C(m) = £{m) n jM(m) n /C £ (m). We 
shall bound, for each m ^ w • n 3 / 2 , the probability that m is the minimal m6N such that 
/C(m — 1) holds, and 

A^(F)(m) £ A> A (F)(m) ± /^(t) • N A {F){n z ' 2 ) 

for some which is faithful at time t — m • n~ 3 / 2 . Note that the event in the statement of 
the proposition implies that this event holds for some m ^ u ■ r?l 2 . 
Fix mo ^ to ■ n 3 / 2 , and for each m! ^ mo, define random variables 

e(F) ■ N A (F)(m) ^ f F , A (t) ■ N A (F)(n 3 / 2 Y 



M±(m') = J2 

m=0 

and 

m'-l 

M±(m') = £ 



C^(F)(m)- v ' ""J" J ± 



t ■ n 3//2 n 3 / 2 



m=0 



n (M y 8t-o(F)-iV A (F)(m) C ■ o(F) ■ (t + 1) ~ 3/2) 



It follows from Lemmas 4.58 and 4.59 that, while the event £(m) fl y(m) fl Z(m) fl Q(m) 
holds, and are both super-/sub-martingale pairs. Now, set 

a = ( nJn {„-, (log,,)*™.} • "° S "l^^^ + liM) . N A ( F) (n^ 

and 

/ (l0 gn) ^) + /^MV 
y n 6 ' 2 m / 

By Lemma 4.60, we have 

-p < AMp(m) + AMp(m) < a 
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while £(m) n M(m) holds. Moreover, since f F , A (t ) ^ n- 1 / 4 (logn) A(F ' A)_ v /A ( i? ' A ) ) and we 
may assume that m Q ^ n £ , we have 

a- (3 -mo /f,a(*o) 2 
N A (F)(n 3 / 2 ) 2 " (logn) 4 ' 
Hence, by Lemma 3.2, we obtain 

P^M^(mo) > ^/^(t )iV A (F)(n 3 / 2 )^ n /C(m - 1)) < e"^") 3 , 

and similarly for M^, and Mj. 

Note that the number of choices for 6 is negligible, since \A\ ^ (logn) 1/5 . Via a straight- 
forward calculation it follows that, with probability at most n ~ clogn , 

mo ~ 1 / N 
N^(F)(m ) = [C^F){m) - D^F){m)) <£ N A (F)(m ) ± f F , A (t )N A (F)(n 3 / 2 ), 

m=0 

for some faithful 0: A — > V(G rn ), as required. □ 

4.7. Proof of Theorem 4.1. Finally, let us put the pieces together and prove Theorem 4.1 
using the martingale method introduced in Section 3. We shall break up the event that the 
theorem fails to hold into four sub-events, depending on whether the first graph structure 
triple which goes astray is tracking and/or balanced. 

Definition 4.61. For each m G [m*], we define events £i(m), 82(171), 83(111) and £4(171) as 
follows: 

(a) S\(m) denotes the event that £(m — 1) n M.(m — 1) n K, (m — 1) holds, and that 
there exists a graph structure triple (F, A, 0) with < t ^ u < t A (F) such that 

N^F)(in) £ N A (F)(in) ± f FA (t)N A (F)(n 3 / 2 ), 

and is faithful at time t. 

(b) £2(111) denotes the event that £(in — 1) n M(in — 1) n JC £ (m — 1) holds, and that 
there exists a graph structure triple (F, A, 0) with 00 < t < t A (F) such that 

N^F)(m) £ (l±g FA (t))N A (F)(m), 

and is faithful at time t. 

(c) £3(111) denotes the event that £(m — 1) D K, £ (m — 1) holds, and that there exists a 
balanced graph structure triple (F, A, 0) with t > t A (F), such that either 

A^(F)(m) > max{e-°( F )(* 2 -^( F ) 2 )(logn) A ( F ' A ),(logn) A ( F -^} 

and is faithful at time t, or t A (F) = 0, 

A^(F)(m) > (logn) A ^ F - v ^ 
and is faithful at time t. 

48 More precisely, a first, since there could be several which go astray at the same step. 
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(d) £4(171) denotes the event that none of the events £i(m), 82(171) and 83(771) holds, but 
the event £(m — 1) D M.(m — 1) fl K £ (m — 1) holds, and there exists an unbalanced 
graph structure triple (F, A, <p) with t > tA(F), such that 49 

A^(F)(m) > (logn) A{F > H ^N H (F)(m + ), 

where A C H C F is minimal such that t < t H (F), and is faithful at time t. 

It is easy to see 50 that 

[J £(m) c n /C £ (m - 1) C |J ^(m) U £ 2 (m) U £ 3 (m) U £ 4 (m), 

m=l m=l 

i.e., if the conclusion of Theorem 4.1 fails, then one of £i(m), £ 2 (m), £3(111) and £4(171) must 
hold for some m e [m*]. It will therefore suffice to bound the probabilities of these events. 
The next three lemmas do so; the first follows immediately from Propositions 4.43 and 4.55. 

Lemma 4.62. For every m e [m*\, 

P(£i(m))+P(£ 3 (m)) < 2-n- 31ogn . 

In the next lemma, we use the martingale argument from Section 3 to control N^F) in 
the range u < t < t A (F). 

Lemma 4.63. For every m e 

P(£ 2 (m)) < n- clogn . (78) 

Proof. Noting that P(£^2(^)) = if t ^ cu, let u> < t' ^ £*, set m' = t' -n 3//2 and suppose that 
£2(171') holds. Let (F,A,<p) be the graph structure triple for which \N1(F)(m')\ > 1, and for 
which u < t' ^ t A (F) and <fi is faithful at time t'. Moreover, observe that 

(logn)^ F ' A ) ^ n v A {F) + e(F)+l^ (7Q) 

since otherwise the bound in Theorem 4.1(6) (and hence £ 2 (m')) holds trivially, and that the 
event £{w! — 1) fl M.(m' — 1) fl K, £ (m' — 1) holds. We shall apply the martingale method 
introduced in Section 3 to the self-correcting variable N^F), and then sum over choices of 
(F, A, <p) and wl . 

We begin by choosing a family of parameters as in Definition 3.4. Set )C(m) = £(m) D 
M(m) H y(m) H Z(m) H Q(m) and / = [a, b] = [u ■ n 3 / 2 ,t A (F) • n 3 / 2 ], and let 

a(0 = (bgn)-v^.-^^ 

and 

C-logn 1 + ^ A (0 



m = 



n 3 ' 2 g F , A (t) 



Here, as in the statement of Theorem 4.1, we set m + = max{m, n 3 / 2 }. 
50 Indced, simply consider the minimum to such that £(m) c n IC £ (m — 1) holds, and observe that if 
M(m — l) c holds then £3(171 — 1) does too. 
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Moreover, set A = C ■ c(F, A), 5 = e and h(t) = t ■ n~ 3//2 , and note that a and (5 are A-slow 
on [a, b], and that a(t) < e 2 and 51 

min {«(*), (3{t), hit)} > 

for every u < t ^ t^F). Hence (A, 5; Qf,Aj h; a, (3; /C) is a reasonable collection. 

We claim that N^(F) satisfies the conditions of Lemma 3.5 as long as <fi is faithful. In- 
deed, Lemma 4.18 implies that N^(F) is (gF,A, h; /C) -self-correcting, since c(F,A) ^ 2, and 
Lemmas 4.26 and 4.52 imply that 



\AN;(F)(m)\ < a(i) and E[|A7V;(F)(m)|j < /3(t) 

for every oo < t ^ tA(F) for which K(m) holds. Moreover, the bound |A^(F)(a)| < 1/2 

follows from the event £(a), since f Ft A{u)N A (F)(n 3 / 2 ) ^ i A(w)iV J 4(F)(a). 
Finally, observe that, since e(F) + o(F) <C logn, we have 

a{t)(5{t)n" 2 < (log^^Oog^-v 7 ^) < ^-L^ 

for every w < t < tA(F). By Lemma 3.5, it follows that 

p((|iV;(F)(m')| > 1) n K(w! - 1)) < n 4 • e"^") 4 < n -( lo §«) 2 . 
Summing over our choices 52 for (F,A,<f>) and m', we obtain (78), as required. □ 

Finally, let's use the building sequences introduced in Section 4.4 to show that an unbal- 
anced non-tracking graph structure cannot be the first to go astray. 

Lemma 4.64. For every m G [m*], 

P (^(m)) = 0. 

Proof. Let m G [m*], and let (F,A) be an unbalanced graph structure pair with t > t A (F). 
We claim that if (£i(m) U £ 2 (m) U £ 3 (m)) c and £{m - 1) n .M(m - 1) n /C £ (m - 1) both 
hold, and if 0: A — > V{G rn ) is faithful at time t, then 

A^(F)(m) < (logn) A(F ' H ' A) iVH(F)(m + ), (80) 

where A C if C F is minimal such that t < tn(F), and m + = max{m, n 3//2 }. Since (F, A, <p) 
was arbitrary, this implies immediately that P (£4(771)) = 0, as required. 
In order to prove (80), let the building sequence of (F, A) be 

A C H C ■ ■ ■ C Hi = F, 

and recall that £j = t*^. (Hi) for each % G and that = to < t\ < • • • < t£ ^ 00, by 
Lemma 4.5. Suppose that tj ^ t < tj+i, and recall that Hj is the minimal A C H C F 



51 Hcre we use Lemma 4.41 together with (79) to show that a(f) ^ n E gp A (t) ^ n 1 / 4 e . 
52 0ncc again, by our assumed bound on u(-F) + e(F) + o(F), the number of choices is negligible. 
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such that t < t H (F), by Lemma 4.6. Recall also that each pair (H ,A) and (H i+1 , Hi) is 
balanced, by Lemma 4.9. Thus, since S(m — 1) fl JC £ (m — 1) fl 83(171)° holds, we have 

Nt(H )(m) ^ (logn) A ^- A ) and A^(# m )(m) ^ (logn) A(H ^ H ^ 

for every ^ i ^ j — 1, and every map fa : iij — » V(G m ) which is faithful at time t. Since 

iV (F) ^ N^Hq) (I] max A^tf^i)") • max N^(F) 

by Lemma 4.10, where the maxima are over faithful maps fa, and 

j'-i 

A(i/ ,A) + ^A(iJ m ,iJ J ) < A(^,A), 

i=0 

by the definition (23) of A(F, A) and the convexity of the function x h-> x c , it follows that 
A^(F)(m) < (logn) A ^>. max N^(F)(m). (81) 

Finally, we claim that, 

N^(F)(m) < (logr i ) A ^)^.(F)(m + ). (82) 

To prove (82), suppose first that t ^ u. Then, since £(m — 1) fl lC £ {m — 1) fl £i(m) c holds 
and £ < = t* Hj (F), we have 53 

A^.(F)(m) < iV^(F)(m) + /^(t)iV^(F)(n 3/2 ) < (\ogn) A ^-^ A ^N Hj (F)(m + ). 

On the other hand, if t > 00 then, since S(m — 1) fl .M(m — 1) fl JC £ (m — 1) fl S 2 (m) c holds 
and t < t,- +1 = t* Hj (F), we have 

A^.(F)(m) < (l + ^ iif ,(t))7V if .(F)(m) < (log n) A ^N Hj (F)(m + ), 
as claimed. Combining (81) and (82), we obtain 

A^(F)(m) ^ (\ogn) A ^^ +A ^ H ^N H .(F)(m + ) = (\ogn) A ^^ N Hj (F)(m + ), 
as required. □ 
Proof of Theorem Since, as noted above, we have 

[J £{m) c n K. £ (m - 1) C (J ^(m) U £ 2 (m) U £ 3 (m) U £ 4 (m), 

m=l m=l 

the theorem follows immediately from Lemmas 4.62, 4.63 and 4.64. □ 



5:5 



To see this, simply note that N Hj (F)(m)+N Hj (F)(n 3 / 2 ) < ( e 4 " 2 °( F ) + l) N H . (F)(m+) for every t < uj. 
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5. Tracking Y e: and mixing in the F-graph 

In this section we shall show how to use the events £{m) and X(m) to track Y e for each 
open edge e. Set a = 00 ■ n 3 / 2 and 

K y {m) = S(m) n X(m) n y(a) H Z{m) n Q(m) 

for each m G [to*], and recall that g y (t) = e 2t2 ra -1 / 4 (logra) 4 and that n ^ n (e,C, u) is 
chosen sufficiently large. The aim of this section is to prove the following proposition. 

Proposition 5.1. Let u ■ v?! 2 < m ^ m* . With probability at least 1 — n~ closn , either the 
event K, y (m — l) c holds, or 

Y e (m) G (1 ± g y (t)) ■ 4t^e~ At2 (83) 
for every open edge e G 0{G m ). 

We begin by defining a collection of variables V° which are averages of the variables Yf 
over various multi-sets of edges. Given two open edges e, / G 0(G m ) ) recall that e and / are 
said to be Y -neighbours in G m if / G F 6 (to) and (equivalently) e G Y/(m). 

Definition 5.2 (The V-graph). Given the graph G m at time t, define the Y -graph of G m 
to be the graph with vertex set 0{G m ), whose edges are pairs of F-neighbours in G m . 

Note that when we take a step in the V-graph, one of our endpoints remains the same, 
and the other changes. We shall therefore imagine ourselves walking through the y-graph, 
with a foot on each endpoint of the currently occupied open edge, and a step being taken 
either with the left or the right foot. Let us call the sequence of left / rights the type of a 
walk; we shall need to differentiate between paths in the F-graph of different types. 

Let k := |~3/e] be a constant fixed throughout the proof. We say that a sequence a = 
(<7i, . . . ,ai) of lefts and rights 54 is k-short if every string of consecutive lefts or rights has 
length at most k, and there are at most k 'changes of foot', i.e., 

<7 i+ i = . . . = a i+j =>- j < k and | {i G \i - 1] : Oi ^ cr i+1 } | < k. 

Note that if a is /c-short then \a\ ^ k(k + 1). The idea behind this definition is that if we 
take more than k steps in a row with the same foot, then we will be very well 'mixed' in the 
open neighbourhood of our planted foot, while if we alternate feet more than k times, we 
will be well mixed in the whole graph. 

Before continuing, let us make an important remark. In this section, we shall write e 
to denote an open edge of G m with an orientation; that is, we assign the symbols L and R 
to the endpoints of e. This orientation 55 will be inherited by the F-neighbours of e in the 
obvious way: if we write / G Y e (m), then the label of the vertex v G e fl / (as an endpoint 
of /) is the same as its label an endpoint of e. Moreover, let us write 

Y e L (m) = {/ G Y e (m) : the vertex v G e \ / has label l\, 



We shall typically use the letter t to denote the length \o\ of such a sequence a. 
55 In particular, we emphasize that the same edge may have different orientations at different points in 
the proof, and even in the same cr-walk. In other words, we denote by e an (edge, orientation)-pair. 
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and set Y e R (m) = Y e (m) \ Y e L (m), and define X e L (m) and Xf (to) similarly. 56 

Given open edges ei,e2 G 0(G m ) (with orientations) and a sequence a G {L,R}*, where 
{L, R}* denotes the set of finite strings of lefts and rights, we shall say that a sequence W 
of open edges in G m is a a-walk from e x to e 2 if the following hold: 

W = (f , /i, . . . , /h), f = ei, = e 2 and G ly^m) for ever y 1 ^ * ^ l°i 

That is, f\ and are V-neighbours in G m , and the vertex i> G fi \ fi-i has label <7j. 
We can now define the random variables which we shall need to track. 

Definition 5.3. Let o G {L,R}*, let to G [to*] and let e G 0(G m ). We shall write 

u:(m)= E E ••• E ^ 

/ ie y e CT1 (m) /aey^ 2 (m) /^Y^ (m) 

to denote the number 57 of cr-walks starting from e, and 58 

v ' {m) = ukn) E E •■■ E iw-o. 

for the average of the F-values reached via such walks. 

In particular, we note that if |<r| = then the unique cr-path starting from e is W = (e), 
and hence U°{m) = 1 and V°(m) = Y e (m). For each a G {L,R}*, set 

g*(t)=e\*\g y (t). 

We emphasize that this error bound improves exponentially as \a\ increases; this property 
will be crucial in the proof below. 

We shall prove the following generalization of Proposition 5.1. It says that, for any /c-short 
sequence a, with (very) high probability V° is tracking up to time t*. 

Proposition 5.4. Let cu ■ n 3 ^ 2 < m ^ m* . Then, with probability at least 1 — n~ clogn , either 
JC y (m — l) c holds, or 

V:(m) G (1 ± g a (t)) ■ Ate- At2 ^i (84) 
for every e G 0{G m ), and every k-short sequence a G {L,R}*. 

In order to prove Proposition 5.4, we shall need the following bounds on U°{m), which 
may easily be proved using the method of Section 3.4, see the Appendix for the details. 

Proposition 5.5. Letu-n 3 / 2 < to < m* . Then, with probability at least l—n~ clogn , either 
lC y {m — l) c holds, or 

Y e L (m) G (l±^(t))-(2te- 4 ' 2 v^) (85) 
for every e G 0{G m ), and hence 

U;(m) G (1 ± 9x (t)) lal • (2te- 4t2 v^) H (86) 



56 So, for example, X^(m) = {/ G X e (m) : the vertex v G e \ / has label L}. 

57 Later, we shall also write U^(m) to denote the multi-set of open edges reached via a a- walk from e. 
58 As usual, if e g 0(G m ) then we set U°(m) = U°(m - 1) and V e a (m) = V°(m - 1). 
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for every e G 0{G m ), and every sequence a G {L,R}*. 

We shall denote by U(m') and V(m') the events that the inequalities (85) (resp. (84)) hold 
for every u ■ n 3 / 2 < m ^ m 1 , every e G 0(G m ) and every /c-short sequence a G {L,R}*. 
Recall that V e (m) = Y e (m) = U^(m) + U R (m), and note that therefore Proposition 5.4 
immediately implies Proposition 5.1. It will also be convenient in this section to write ^P(m') 
for the event that (83) holds for every to ■ n 3 ^ 2 < m ^ m' , since 

V(m) =4> j)(m), 

and we shall need to assume that y{m) D 3^(^) holds in several of the lemmas below. 

The proof of Proposition 5.4 is roughly as follows. First we show that by taking k steps 
with the left foot (say), we 'mix well' in the open neighbourhood of our right foot. More 
precisely writing Q u (m) for the collection of open edges incident to vertex u, and assuming 
that our right foot is fixed at u, we shall prove that V e Lk (m) is within a factor of 1 ± o(g y (t)) 
of the average of Yf(m) over / G Q u {m)- We also prove a similar bound on V°(m) for any 
(bounded length) sequence a which changes foot at least k times. The only difference is 
that we now mix well in the entire V-graph, i.e., we prove that V°(m) is within a factor of 
1 ± o(g y (t)) of Y(m). Both mixing results follow from the event £{m) (see Theorem 4.1), 
applied to the graph structures which correspond to a- walks from e to /. 

We next control the one-step changes in V° for each /c-short sequence a, and show that 
the variable V e CT is self-correcting, assuming that none of the other variables have yet gone 
astray. The mixing results above are a crucial tool in this calculation, since the rate of 
change of V e °" depends on V° L and V° R , and these longer sequences 59 may not be /c-short. 
Finally, we apply the method of Section 3 to bound, for each /c-short sequence a G {L, R}*, 
the probability that V" is the first variable to cross its Line of Death. 

5.1. Mixing inside open neighbourhoods. Our aim over the next few subsections is to 
show that V e CT is self-correcting; we do so by writing E[AV e °"(m)] in terms of V° L {m) and 
V/ R (m), which we can control as long as both o~L and o~R are also /c-short. In the next two 
subsections we shall deal with the other case. 

The key lemma of this subsection is as follows; it says (roughly) that a walk of length k 
in the open neighbourhood of a vertex is well-mixed. 

Lemma 5.6. Let u ■ n 3//2 < m ^ m* . If S(m) fl j^(^) holds, then 

K k (m) - V e Lk+ \m)\ = o(g y (t)Y(m)) (87) 

for every open edge e G 0(G m ). 

Note that the o(-)-term in the lemma indicates that the left-hand side of (87) divided by 
g y (t)Y(m) is (uniformly over times u < t ^ t*) at most some function of n which tends to 
zero as n — )• oo. In particular, this implies that if n ^ n (e,C,uj), then the left-hand side 
of (87) is smaller than g a {t)Y{m) for every /c-short sequence a. 



59 If a — (oi, . . . , at) then aL = (oi, . . . , at, L) and aR = (<7i, . . . , at, R). 
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We shall deduce Lemma 5.6 from the following lemma. Recall that Q u {m) denotes the 
open neighbourhood of the vertex u in G m , i.e., Q u {jn) = {e G 0(G m ) : u G e}. 

Lemma 5.7. Let to ■ n 3 ^ 2 < m < m* . If £(m) D j)(m) /io/ds, t/ien 

Ke (m) Q u ( m ) ^ F/(m) 

/or every open edge e G Q u {jn). 

In order to prove Lemma 5.7, we shall use the following simple facts. 60 
Observation 5.8. Let a±, . . . , a r+s G (1 ± 7) a, and suppose that s ^ 5r. TTien 

G (1±0( 7 5)) 

Observation 5.9. Lei 01, . . . , a r G (1 ± 7)0 and b 1: . . . , b r G (1 ± 5)b. Then 

r ■ E a i b i e ( 1±0 (^))(l>)(X>)- 

j= i \ j=i / \j=i / 

We shall also need a little extra terminology. 

Definition 5.10 (a-paths and a-cycles). Let a G {L, R}* and e, / G 0(G m ). A a-walk from 
e to / which leaves each endpoint of e at most once, and arrives at every other vertex at 
most once, is called a a -path. If moreover e = f, then it is also called a a-cycle. 

Let us write P°{m) for the number of a-paths starting at an open edge e, and recall that 
U°{m) denotes the number of a-walks starting at e. We shall use the event £(m) to bound 
the number of a-paths (or cycles) between two edges e, / G Q u {m). 

Finally, we make an observation which will be useful several times in this section. 

Observation 5.11. Let cu ■ n 3 / 2 < m ^ m* . If S(m) holds, then 

-^•f(m) H ^ U a e {m) ^ f(m) H 
o 

for every a G {L, R}* with \a\ ^ to. 

Proof. Simply note that, since the event £{m) holds, the variables Y^{m) and Y^{m) are 
each at most Y(m), and at least Y(m)/3, for every / G 0(G m ). □ 

Proof of Lemma 5.7. Let cu ■ n 3 / 2 < m < m*, recall that = [3/e], and set a = 
Moreover, let n G l / (G m ) and e G <3 u (m). We claim that 

(a) the number of a-paths from e to / is the same, up to a factor of 1 ± o(l), for each 
open edge / G Q u (jn), and 

(6) almost all a-walks starting at e are in fact a-paths. 
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In both observations, all variables are assumed to be positive real numbers. 
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Using Observations 5.8 and 5.9, the lemma will follow easily from (a) and (b). 

Suppose first that e^/, and consider the graph structure triple (F, A, <fi) which represents 
a cr-path from e to /, see Figure 5.1(a). Note that v(F) = k + 2, \A\ — \e U /| = 3, e(F) = k 
and o(F) —k — 1. We claim that t A (F) = t* . 




Figure 5.1. L fc -paths from e to /. 

To prove the claim, recall from Definition 2.10 and (28) that if tA(F) < t*, then there 
exists an induced sub-structure A C H C F such that either e(H) ^ 2va(H) or 

Zt^y^fa-.)^-,. (SB, 

Note that e(H) ^ v A {H) + 1 H=F and o(H) = v A (H). Since v A (F) = k - 1 > 1/e > 1, it 
follows that e(H) < 2v A (H) and 

2v A (H) - e(H) 1/ _J_\ / J_ 
8o(i7) ^ 8 V k — 1 J \2V2 

and so ^(F) = £*, as claimed. Hence, assuming £{m) holds, and noting that <fi is faithful, 
we have 

N^F) G (l±o(l))N A (F), (89) 

since ^y(F,A) ^ u, and so gF,A(t) ^ «.~ e for every 00 < t < f . 

On the other hand, if e = / then consider the graph structure triple (F',A',(fJ) which 
represents a cr-cycle from e to itself, see Figure 5.1(6). Since k > 2, the structure (F',A') is 
obtained from (F, A) by identifying two vertices of A with no common neighbours, and it 
follows that Na'(F') = N A (F), and that 

AV(F') e {l±o(l))N A ,(F') G (l±o(l))iV A (F), 

exactly as above. 
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It follows that the number of cr-paths from e to / is within a factor of 1 ± o(l) of Na(F) 
for every / G Q u (to). It remains to observe that, assuming £{m) holds, 

~ Y( 7Tl) k 

U°(m)-P°(m) < uj-Y(m) k - 1 < \ ' < U°{m). (90) 

To see the first inequality, simply sum over the (less than) k 2 < tu 2 choices for the two steps 
1 ^ % < j ^ k which use the same vertex, and note that, since the event £{m) holds, we 

have at most (l + o(l))^y^ < Y{m) choices 61 for each of the steps except j, and at most 
one choice for step j (given step i). The second inequality holds since u 2 ■ 3 W <C n £ ^ Y(m), 
and the third holds by Observation 5.11. 

Hence, writing (to) for the average of Yf(m) over the multi-set of open edges / reached 
by cr-paths from e, we claim that 

rtm)6(i±«)-rwc^M e y /(™)> 

as required. Indeed, the first inclusion follows from Observation 5.8, using (90) and the event 
y(m), by enumerating the a- walks from e, and setting aj = Yf(m) if the j th walk ends at 
edge /. The second inclusion follows by Observation 5.9 and the event JP(to), by setting (for 
each / G Q u ) a/ equal to the number of cr-walks from e to /, and bf = Yf(m). □ 

Noting that Lemma 5.7 also holds for a = L k+1 , it is now easy to deduce Lemma 5.6. 

Proof of Lemma 5.6. By Lemma 5.7 we have 

K\m)-Vr\m)\ < °-^§ JJ YM) . 

' feQ u (m) 

Now Yf(m) G (l ± o(l))F(m) for every / G Q u { m ), by !V(to), so the lemma follows. □ 

5.2. Mixing in the whole F-graph. The other way in which oL or oR might not be 
/c-short is if a changes foot more than k times. The following lemma follows from the fact 
that such a a-walk 'mixes fast' in the entire F-graph. Let's write s(o~) = \{i : o~i ^ for 
the number of 'changes of foot' during a a- walk. 

Lemma 5.12. Let to ■ n 3 ^ 2 < m ^ m* , and let a G {L, R}* be a sequence with \a\ ^ to and 
s(a) ^ k. If £{m) fl y(m) holds, then 

K(m)-V? L (m)\ = o(g y (t)Y(m)) 

for every open edge e G 0(G m ). 

We shall deduce Lemma 5.12 from the following lemma. 



61 



We shall use this bound, Y e L (m) < Y(m), several times below without further comment. 
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Lemma 5.13. Let to ■ n 3 / 2 < m ^ m* , and let a G {L,R}* be a sequence with \o~\ ^ to and 
s(a) ^ k. If £{m) r\y(m) holds, then 

\V:(m)-Y(m)\ = o(g y (t)Y(m)) 

for every open edge e G 0(G m ). 

Proof The proof is similar to that of Lemma 5.7, above. Indeed, let to ■ n 3//2 < m ^ to*, let 
e G 0(G m ), and let a G {L, R}* be a sequence with \a\ < u> and s(a) > k. We claim that 

(a) the number of a-paths from e to / is the same, up to a factor of 1 ± o(l), for each 

open edge / G 0(G m ), and 
(6) almost all cr-walks starting at e are in fact a-paths. 

The second claim again follows by (90), so we shall need to prove only the first. 

Suppose first that e and / are disjoint, and consider the graph structure triple (F, A, (f>) 
which corresponds to a cr-path from e to /, see Figure 5.2(a). We claim that tA^F) = t*. 




Figure 5.2. a-paths from e to /. 

To prove the claim, recall from the proof of Lemma 5.7 that if tA(F) < t*, then there 
exists an induced sub-structure A C H C F such that either e(H) ^ 2va{H) or (88) holds. 
We claim first that if e and / are in the same component of H, i.e., that there is a path in 
E(H) U 0(H) from an endpoint of e to an endpoint of /, then 

v A {H) > s(ff) 2 ~ 3 ^ ^ e (H) < v A (H) + 2 and o(H) ^ v A (H) + 2. 

To prove the bound on va(H), simply note that each consecutive pair of 'pivot' vertices in 
F form a cut-set which divides e from /, and so H must contain at least one of each such 
pair. To prove the bounds on e(H) and o(H), consider the vertices of H according to the 
order in which they are reached by the cr-walk. Each vertex sends at most one edge and one 
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open edge backwards, and thus e(H) ^ va(H) + 2 and o(H) ^ va(H) + 2, as claimed 62 . It 
follows that e(H) < 2va(H), since va(H) > k/3 > 2, and moreover 

2 M H)-e { H) > 1 A»,W-2^ ^ 1 / _ _12_\ > / 1 _ V (M) 



8o(#) 8VM#) + 2 / 8\ A: + 6 y VSv^ 

for all such H, since fc + 6 > 3/e. 

On the other hand, suppose that e and / are in different components of H. Then we 
may apply exactly the same argument to each component, and obtain the stronger bounds 
e(H) ^ va(H) and o(H) ^ va(H) (component- wise, and thus also in H). Thus 2va(H) — 
e(H) > o(H), and so (91) still holds, i.e., t* A (H) > t*. Hence t A (F) = t*, as claimed. 

Now, since |<x| ^ to, it follows that ^(F, A) ^ (logn)°^\ and so gF,A(t) ^ n~ £ for every 
cj < t < t*. Since £(m) holds and is faithful at time t, it follows that 

N^F)(m) e (l±o(l))N A (F)(m). (92) 

The proof when e and / are not disjoint is similar. Indeed, consider the graph structure 
triples (F r , A', </)') and (F", A", </)") which represents a a-path from e to / when \eC\f\ = 1 and 
when e = /, respectively, see Figure 5.2(6) and (c). We claim that tA'(F') = t A "(F") = t*. 
Indeed, since s(a) > 2, it follows that (F',A') and F",A") are obtained from (F,A) by 
identifying vertices of A with no common neighbours. Thus Na'(F') = Na"{F") = N A {F), 
and hence we obtain 

ify(F') G (1 ± o(l))7V A (F) and N r {F") e (l ± o(l))iV A (F), 

exactly as above. 

We have proved that the number of cr-paths from e to / is within a factor of 1 ± o(l) 
of N A (F) for every / e 0(G m ). Hence, writing ^/(m) for the average of Yf(m) over the 
multi-set of open edges / reached by u-paths from e, we claim that 



V:(m) e {l±o(g y (t)))V e a (m) Q (l ± o(g y (t)))Yi 



m) 



as required. Indeed, the first inclusion again follows from Observation 5.8, using (90) and 
the event y(m), and the second inclusion follows by Observation 5.9 and the event y(m), 
exactly as before. □ 

It is now easy to deduce Lemma 5.12. 

Proof of Lemma 5.12. By Lemma 5.13, we have 

\V:{m)-V: L {m)\ < o(g y (t)Y(m)), 

and since j)(m) holds, we have Y(m) G (l±o(l))Y(m). The lemma follows immediately. □ 



62 



Note that the +2 term corresponds to the two endpoirits of /. 
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5.3. Creating and destroying a-walks. In this subsection we shall bound the maximum 
possible size of |AV^ CT (m)|; not only will we need this bound in the martingale calculation, 
but the walk-counting lemmas below will be useful in proving self-correction. 
The main result of the subsection is as follows. 

Lemma 5.14. Let to ■ n 3 / 2 < m ^ m* . If E(m) D Z(m) holds, then 

\AV e a (m)\ < (logn) 4 

for every e G 0(G m ) and every a G {L, R}* with \a\ ^ ui. 

We begin with a straightforward observation, which will allow us to control AU°(m). 

Observation 5.15. Let m < to*. If Z(m) holds then, for every e, / G 0(G m ) with e//, 
there are at most (logn) 2 walks of length two in the Y -graph from e to f . 

Proof. If e and / are disjoint, then there are at most four walks of length two from e to /, 
so assume that e = {u, v} and / = {u, w}. Then the first step must take us to an open edge 
h = {u, x} with {v,x} G E(G m ) (since h G Y e (m)) and {w,x} G E(G m ) (since h G Yf(m)), 
i.e., a; is a common neighbour of v and w. Since event Z{m) holds, there are at most (logn) 2 
such vertices, and so there are at most (logn) 2 such walks, as claimed. □ 

We can now bound the number of a- walks between any two open edges of G m . 

Lemma 5.16. Let u ■ n 3/2 < to < m* , let e, f G 0(G m ) with e^/, and let a G {L, R}* 
with \a\ ^ 2. If E(m) fl Z(m) holds, then there are at most 

2-(logn) 2 -F(m)l (T l- 2 

a-walks from e to f . 

Proof Since E(m) holds, it follows that Y^{m) ^ Y(m) for every e G 0(G m ), and hence we 
have at most Y{m) choices for each step of a cr-walk on G m . Let h be the open edge we have 
reached after |<r| — 2 steps, and consider two cases: either h — f, or h ^ f. 

If h = f, then the bound is easy: we have only one choice for steps \a\ — 2 and |<r| (since 
both must land on /), and so there are at most Y(m)^~ 2 such walks. On the other hand, 
if h ^ f then by Observation 5.15 there are at most (logn) 2 choices for the final two steps, 
since Z(m) holds. Hence there are at most (logn) 2 • F(m)' CT '~ 2 such walks, as required. □ 

We can now easily bound |A[/f(m)|. 

Lemma 5.17. Let u ■ n 3 / 2 < m < m* . If E(m) fl Z(m) holds, then 

\AU°{m)\ ^ \a\ ■ (logn) 3 - Y(m)^- 1 
for every e G 0(G m ) and every a G {L, R}* . 

Proof. Note first that if |c| = then the lemma is trivial, since AU^(m) = 0. Moreover, 
if \a\ = 1 then we have U° = Y e L or U° = Y e R , and it is easy to see (cf. Section 4.3) that 
— (logn) 2 ^ AY e L (m) ^ 1, since Z(m) holds, and so the lemma follows in this case also. 
Therefore, let us assume from now on that \a\ ^ 2. 
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Let / denote the edge added in step m + 1 of the triangle-free process, and note that if 
either e = / or / e Y e (m) then we are done, since 63 in that case AU°(m) = 0. So assume 
that e^ / and e (jL Yy(m), and observe that a a- walk is destroyed by the addition of / only 
if it passes through a F-neighbour of /, and is created only if it passes (in consecutive steps) 
through both of a pair of X-neighbours of /. 

We claim that at most \a\ ■ F(m)l°"l _1 c-walks from e are created in step m + 1, and that 
at most 

M 

(4 • (logn) 2 • Y (m) 1 - 1 ■ Y (m)M- r ) <; \ a \ . (\ ogn f ■ Y^m)^ 1 (93) 

r=l 

such walks are destroyed in the same step. To prove the former bound, simply note that 
one of the steps (the one between the X-neighbours of /) is pre-chosen and that, since S(m) 
holds, we have at most Y(m) choices for each of the other steps. To prove that (93) is 
an upper bound on the number of destroyed walks, let r denote the number of steps after 
which we first reach a F-neighbour of /, and note that r ^ 1 since e ^ Yf(m). Now, by 
Lemma 5.16, since e ^ / and S(m) D Z(m) holds, we have at most 4 • (logn) 2 • Y(m) r ~ l 
choices for the walk 64 from e to some V-neighbour of /. 

This proves the claimed bounds on the number of a-walks created or destroyed in a single 
step. The lemma follows by taking the maximum of the two bounds. □ 

Putting these lemmas together, we obtain Lemma 5.14. To simplify the notation, we shall 
write U°{m) to denote the multi-set of open edges reached via a cr-walk from e, as well as 
for the size of this multi-set. Note that, in this notation, if \a\ = then U°(m) = {e}. 

Proof of Lemma 5.14- Fix cu ■ n 3 ^ 2 < m ^ m*, an open edge e G 0(G m ) and a sequence 
a E {L,R}*, and assume that £{m) D Z(m) holds. The lemma holds if V° = Y e , as noted 
above, so let us assume that \a\ > 0. We begin by bounding 

A(y:{m)-U° e {m))\ = a( £ Y f (m)\ < 2 • Y(m) ■ \ AU:(m) \ + £ |Ar>(m)|, 

where the inequality holds since Yf(m) ^ 2 • Y(m), which follows from the event £{m). We 
claim that 

2-Y(m) ■ \AU°{m)\ + \AY f (m)\ ^ 3 ■ \a\ ■ (log n) 3 -Y(m)^. 

feu*{m) 

Indeed, this follows by Lemma 5.17, and the facts that U£(m) ^ Y(m)^ (since £(m) holds) 
and that |AY}(m)| ^ 2 • (logn) 2 (since Z(m) holds), as above. 
Now, by the product rule, we have 

A(v:{m).U:{m)) = C/f (m)A(y/(m)) + WARm)) + A(V7(m)) A(U°{m)). 



63 Recall from Definition 5.3 that if e 0(G m+1 ), then U°(m + 1) = U°(m). 

64 Note that such walks, of length r, are in bijection with the walks from e to / of length r + 1. 
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Moreover, we have V°(m) ^ 2Y(m) (since £(m) holds), and since Y(m) > n £ 3> \a\ ■ (logn) 3 
for every m ^ m*, it follows from Lemma 5.17 that \AU^(m)\ <C U°(m). Hence, by the 
triangle inequality and Lemma 5.17 (again), we have 

\AV°(m)\ < ^ (Vl • (logn) 3 -y(m) H + F(m) • |A£/ e CT (m)|) 

^ z>7R( |(7| ' (logn)3 ' :p(m)H ) ^ (logn)4 ' 

as required. □ 

5.4. Self-correction. In this subsection we shall prove, using the results above, that the 
variables V" exhibit a kind of self-correction. The calculation is somewhat lengthy, and 
requires careful counting of the u-paths created and destroyed in a typical step of the triangle- 
free process. For each m G [m*], each /c-short sequence a G {L,R}* and each open edge 
e G 0(G m ) ) define 

{KT[m) = 

gAt) ■ Y(m) 

Recall that V(m') denotes the event that the variables V°(m) are tracking up to step ml, 
i.e., that |(V e <T )*(m)| < 1 for every u ■ n 3//2 < m < m! ', every e G 0(G m ) and every /c-short 
sequence a. 

Recall that = |~3/e] and n ^ no(e, C, cu). The aim of this subsection is to prove the 
following key lemma, which implies that the variables V° are self-correcting. 

Lemma 5.18. Let to ■ n 3/2 < m < m* . If £(m) fl U (m) fl V(m) n A'(to) fl Z(m) n Q(m) 
holds, then 

E[A(V:r(m)} G (8±5)~(-(17)V)±0( e )) (94) 

/or every e G 0(G m ) and every k-short sequence a G {L, i?}*. 

Our main task will be to prove the following bounds on E [AV7(m)] . We shall then deduce 
Lemma 5.18 using the mixing results from Sections 5.1 and 5.2. 

Lemma 5.19. Let u ■ n 3 / 2 < m < m* . If £{m) flW(m) flAf(m) fl Z(m) holds, then 

E[AV7(m)] e - ' pW>) + WKr>A + ( 1±Cfc(t) )*M 

for every e G 0(G m ) 7 and every k-short sequence a G {L,R}*. 

We begin by controlling E[AC/^(m)] . For each a G {L, R}*, define o~(j) to be the sequence 
formed by the first j elements of a. 65 Lemma 5.19 is a straightforward consequence of the 
following bounds. 



65. 



Thus if cr = (<7 l5 ... , a e ), then cr(j) = (<j u . . . , <jj). 
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Lemma 5.20. Let oj ■ n 3 / 2 < m < m* . If £{m) (lU(m) n X{m) D Z{m) holds, then 

1 



75 



M 



E[AC/ e CT (m)] G 



g(m; 



Y,Ut{m)V^\m) + |<r| 



1 , A X(m)F(m) |a| - 1 



for every e G 0{G m ), and every a G {L, R}* with \a\ ^ oj. 

In order to prove Lemma 5.20, we shall first need to prove a series of simpler lemmas. We 
begin with two easy observations. 

Observation 5.21. Let m < m* and e G 0(G m ). Then 

p(/ 6 nV+D\nV)|G„) = ( ji 5 , 

for each f G X^{m), and is zero otherwise. Moreover, \Y^{m + 1) \ F e L (m)| ^ 1. 

Proof. Let ft be the edge selected in step m + 1, and suppose that / G y e L (m + 1) \ y e L (m). 
Then / G X^(m) and ft, G X^{m) form an open triangle with e in G m , and moreover 
F e L (m + 1) \ Y e L (m) = {/}, as required. □ 

Recall that if e G" 0(G m+ i) then (by convention) we set Y^{m + 1) = Y^{m). 

Observation 5.22. Let m < m* and e G 0(G m ). Then 

Y f (m) - 1 



P 



(/GY;V) \ Y e L (m + l)\G m ) 



Q{m) 



for each f G Y^{m), and is zero otherwise. 

Proof. Let h be the edge selected in step m + 1, and suppose that / G F e L (m) \ F e L (m + 1). 
Then either e or / was closed by h. But if e ^ 0(G m+ i) then Y" e L (m+l) = Y^{m), so we must 
have ft G Yf(m) \ {e}. Any such ft will suffice, and hence we have / G Y^(m) \ Y e L (m + 1) 
with probability exactly (Yf(m) — l)/Q(m), as claimed. □ 

The observations above imply the following two identities. 

Lemma 5.23. Let m ^ m* , e G 0(G m ) and a G {L, R}* . Then 

1 



E 



/£^(m+l)\yi(m) 



C//(m) 



Q(m) 



E U f( m )> 



f£X£(m) 



and 



E 



E r J 

L /eyi(m)\yi(m+i) 



Q(m) 



E {Y f (m)-i)UJ(m). 



(95) 



(96) 



feY e L(m) 

Proof. The first identity follows immediately from Observation 5.21, since the sum on the 
left is equal to UJ(m) with probability 1/Q(m) for each / G Xf(m), and zero otherwise. 
The second identity follows immediately from Observation 5.22, since the sum on the left 
contains / with probability (Yf(m) — 1)/Q(m) for each / G Y^im). □ 

Combining Observation 5.21 with Lemma 5.17, we also obtain the following bounds. 
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Lemma 5.24. For every u ■ n 3 / 2 < m ^ m* , if £(m) n Z(m) Zioids, then 

X{m)Y{m)\ a \- 1 



E 



E A ^ 

feY e L (m+l) 



m) 



G E[Al/;(m)] ± (log 



n) 



Q{m) 



for every e G 0(G m ) 7 and every cr G {L, R}* with \a\ ^ ui. 
Proof. Observe first that, by linearity of expectation, 



E 



E Af/ /( m ) 

f&L(m+l) 



Gr 



E Af/ /( m ) 

/ e y/-(m+l)\Y'/(m) 



- E[AC//(m)] = E 

f€Y e L (m) 

since if / G F e L (m) \ Y" e L (m + 1) then / £ 0(G m+ i), and thus AUJ(m) = 0. 
Now, since £(m) D iT(m) holds, by Lemma 5.17 we have 

\AUJ(m)\ < • (logn) 3 -F(m) l<j| - 1 . 

Moreover, by Observation 5.21, we have / G Y e L (m + 1) \ F e L (m) with probability 1/Q{m) 
for each / G (m). Hence, using the event £(m) to bound X e (m), we have 



E 



E Af W 

L /ey e i (m+i)\y e L (m) 



G 



e±( io gn r.f W i^.|| 



as required. □ 

We can now prove our first bounds on E[AC/^(m)] . For each a G {L, R}*, let us write a~ 
for the sequence obtained by removing the first entry of a. m 

Lemma 5.25. Let u ■ n 3 / 2 < m < m* . If £{m) nW(m) flAf(m) fl Z(m) /io/ds 7 t/ien 



E[AC/f(m)] G ^ (^E[A^(m)] - 



Y}(m)f//-(m) 
Q(m) 



X(m)F(m) 
Q(m) 



o-l-l 



/eyf 1 (m) 

/or every e G 0(G m ) 7 and every <r G {L, i?}* wi£/i < |er| ^ uj. 
Proof Since 

AC/f(m) = U:(m + 1) - C/ e CT (m) = ^ [//"(m + 1) - £ [//"(m), 



it follows that 
E[Af/ e CT (m)] = E 



/eY e CT1 (m+i) 

E E u f 

feYp(m+l) /eY e CTl (m) 



/ey e CT1 (m) 



m 



Gr 



E[AUf(m)]+E U f( m ) ~ E U f( m ) 

feY e ai (m+l) L /ey e CT1 (m+l) feY e ai (m) 



Gr 
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'Thus if a = (a\, . . . , at), then a = (<t 2 , • ■ • , ere)- 
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Now, by Lemma 5.23, and using the event U{m) fl X(m), we have 
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E 



feYP(m+l)\YP(m) 



E 



since | cr | = |cr| — 1 and X^{m) = X^{m). Similarly, we have 



2H • Q(m) ' 



E £ C/;-(m) 

L /ey ( T 1 ( m )\Y e <T1 (m+i) 

and by Lemma 5.24, 



^ ^ y / (m)t//"(m) ± f(m)H 



Q(m) "' Q(to) 



E 



£ AUf(m)- £ AC/ /"( m ) 

/ey e CTl (m+i) /e^H 



a 



G ±(logn)- 



X(m)f(m)H- 2 



Combining the last four displayed equations, and noting that g x (t)X(m) ^> Y(m) ^> (logn) 4 , 
we obtain 



E[AU:(m)} G ^ (E[AC/;-(m)] - 



Y f (m)Uf (to) 



J_ A X(to)F(to)H-i 

2H ±fcWJ g(^) 



□ 



as required. 

In order to prove Lemma 5.20, we shall need the following bound on the covariance of 
Uf(m) and Vf (j \m) over the set Y e ai (m). 

Lemma 5.26. Let to ■ n 3//2 < to ^ to* . IfU(m) holds, then 

Uf (m)Vf U \m) G U°{m)V^ j+1 \m) ± e ■ g x (t) ■ X{m)Y{m)\ a \- 1 (97) 

feYp(m) 

for every e G 0(G m ) and every a G {L, R}* with ^ j < \a\ ^ u>. 

In order to prove Lemma 5.26, we shall use the following corollary of Observation 5.9. 
Observation 5.27. Let cu ■ n 3 / 2 < to < m* . IfU(m) holds, then 



Y e ai (m) 



/ey e 01 (m) 



for every e G 0(G m ) and every a G {L, R}* with < \a\ ^ u. 

Proof. Set Of = (to) and bf = UJ (to) for each / G Y^ 1 (m). By the event U(m), we 
have 

H-i 

a f G (1 ± ^(t))F(m) and 6/ G (l ± &,(*))' 
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and so, by Observation 5.9, 

l^feYP( m) U f K m ) Y - W /ey/1(m) 

as required. □ 

We can now easily deduce Lemma 5.26 from Observations 5.9 and 5.27. 

Proof of Lemma 5.26. Set a f = Vf {j \m) and b f = Uf(m) for each / G Y e ai (m). Noting 
that, 

fY(m)\^~ 1 

a f e(l±g x (t))Y(m) and b f G (l ± g x (t)) u I J , 

which both follow from the event U(m), we obtain 
Yp{m) J2 Uf(m)Vf {j \m)e(l±u 2 g x (t) 2 ) £ Uf(m) £ V^V), 

by Observation 5.9. Applying Observation 5.27 to the sequence a(j + 1), it follows that 
Uf(m)Vf {j) (m) G U a e (m)V: {j+l \m) ± ou 2 g x (t) 2 Y(m)^ +1 , 

which implies (97), since X(m) 3> to 2 ■ g x (t) ■ Y(m) 2 for every m ^ m*. □ 
We are now ready to prove the claimed bounds on E[AC/f (to)] . 

Proof of Lemma 5.20. Let tu-n 3 / 2 < to ^ to*, and assume that £(m)nU(m)nX(m)r\Z(m) 
holds. We shall prove that 

E [ AC/ "<™>] e "OR | V) + H ± *(«>) * (m ff W "' . (^) 

for every e G 0(G m ) and every a G {L, i?}* with \a\ ^ w, by induction on I = \a\. The result 
is trivial if £ = 0, since in that case U°(m) = 1 for every e G 0(G m ) and every to ^ to*. For 
£ = I, assume for simplicity that a = L, and observe that 

f€Y e L (m) 

The bounds in (98) now follow, since U^(m)V e L (m) = X)/gy^(m) -^7( m )> by Definition 5.3, 
and ZY(to) fl A" (to) implies that Y e L (m) +X^(m) G (| ±<7 x (t))X(m), since X^(m) — Xf{m). 

So let £ ^ 2 and assume that (98) holds for every open edge in G m and every sequence of 
length £ — 1. Let e G 0(G m ), and let <7 G {L, By Lemma 5.25, we have 

Y f (m)Uf(m)\ fl i \ X(m)y(m)M-i 



e[a W]6 £ ( E [At/;»] - ^"ggj (m) ) + ± 



Q(to) 
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k-l 



E[AUf(m)} G --l-r ^ Uf{m)Vp\m) + (|a|-l) ( ^ ± <fe(f) 



Q(m) 

for each / G Y^irn). Combining these bounds, we obtain 67 

H-i 

Q(m) ^ 



2H- 



X(m)f(m)H-2 



E 



[ A w] e -^E E u f( m ) v f 

XMFMH-i / i 



+ 



21-1 



F e CT1 (to) 
ffm) 



Q(m) 

Now, by Lemma 5.26, we have 

H-i _ kl 

E E ^N^ ' e ^C/f(m)K CTW (m) ± e. |<r| .g x (t)X(m)Y(mp-\ 

j=0 feYp(m) 3=1 

and by the event U(m), we have Yp(m) G (l ± g x (t))Y(m)/2. Since £ ^ 2, we obtain 68 



E[AC/ e CT (m)] G 



kl 



Q(m; 



Q(to) 



as required. 

We note the following easy identity, which is proved in the Appendix. 
Lemma 5.28. 



□ 



E 



A(m) 
B(m) 



E[AA(m 
B(m) 



1 



B(m) 



-E 



A(m+ l)AB(m) 
B(m+ 1) 



Our bound on E[AV e °"(m)] now follows via a straightforward calculation. 

Proof of Lemma 5.19. Let er G be a /c-short sequence, let e G 0(G m ) for some 

w • n 3 / 2 < to < to*, and suppose that £(to) nU(m) D A'(m) n £(m) holds. Since 

£/ e CTL (m) + £/ e CTfi (m) 



v;tm) 



it follows, by Lemma 5.28, that 
1 



E[AV e a [ 



m 



E 



A(U° L (m) + U° R (m)) - E V e a (m+l)-AU°(m) \ G m .(100) 



Now, by Lemma 5.14, and since £ (to) fl Z{m) holds, we have 

V7(to + 1) G V7(to) ± (logn) 4 , 



67 



Note that the term Yf(m)UJ (m) is absorbed into the sum as the term j = 0. 
68 This follows since \ (l ± g x {t)) (2^ +1 ± g x (t)) € (2- l ±g x {t)). 
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and since U{m) holds, U°(m)Ve U \m) ^ 2 • Y(m)^. Moreover, note that g x (t)X(m) > 
(logn) 4 • Y(m). By Lemma 5.20, it follows that 

r i 1 H 

E V:(m + 1) ■ AU:(m) \ G m G --— £ U° e ' (m)V e ^ (m)V e ° \m) 

, ( 1 , x\ X(m)i > (m)l' 7 l , . 

+ H(a R ±W«)j • (WD 

Moreover, applying Lemma 5.20 to U° L (m) and U° R (m), we obtain 

1 H 

E[A(f/f(m) + !7fH)] G ___^(^( m ) + ^( ro ))^(i)( ro ) 

Now, combining this with (100) and (101), and recalling that U° L (m)+U° R (m) = U°(m)V°(m) 
and 2Hf/ e CT (m) G (l ± H y(m)H, it follows that 

as required. □ 

Combining Lemma 5.19 with mixing results from Sections 5.1 and 5.2, we can now prove 
that the variables (V°)*(m) are self-correcting. 

Proof of Lemma 5.18. Let cu ■ n 3 / 2 < m < m*, let er G {L, i?}* be a A;-short sequence, and 
suppose that the event £{m) nU(m) fl V(m) fl A"(m) fl Z(m) fl Q(m) holds. We claim first 
that, if exactly r G {0, 1, 2} of the sequences aL and aR are not fc-short, then 69 

w;(m) := gW%gMgW 6 (l ±jfc+l(t ))y(n.)V7(m) 

t/ e (m) 

+ (1 ± • \ ■ 9*(t)Y(m) 2 ■ (V:Y(m). (102) 

for every e G 0{G m ). 

To prove (102), suppose first that the sequences aL and aR are both fc-short, i.e., r = 0. 
Then, since the event V(m) holds, we have 

U° L {m)V° L {m) + U: R (m)V° R (m) , . ~ U^jm) + U° R (m) 

— G {l±g a+1 (t))Y{m) — 

= (l±g a+1 (t))Y(m)V:(m), 



69 



Here, and throughout this proof, we write g a+ \(f) to denote the function g a L{t) = 9aR{t)- 



THE TRIANGLE-FREE PROCESS AND R(3,fc) 81 

as required. Next, suppose that r — 1, i.e., that crL is not A;-short (say), but <jR is /c-short. 
Then, by Lemmas 5.6 and 5.12, 

\v:(m) - V: L (m)\ = o(g y (t)Y(m)) < 9a+1 (t)Y(m), (103) 

and hence, since the event U(m) D V(m) holds, 

U aL (m) 

= (l±g a+1 (t))Y(m)V:(m) + _^±-i(l^( m ) - y( m )) 

fl ± a (f^yOn) 

C (1 ± W*))y(m)Kf(m) + A 2 • S^)ra>W. 

as claimed. Finally, if r = 2, i.e., neither aL nor aR is fc-short, then (103) holds for V° L (m), 
and also for V^ R (m), by Lemmas 5.6 and 5.12, and so 

w;(m) 6 ^('")^'-);f('")^( m ) ±gj+i(f)y(m)v . /(m) 

= (1 ± , CT+1 (t))F(m)V; CT (m) + ^ L(m J^ (m) • (W " Y(m)) 

C (1 ±^ +1 (t))F(m)V; CT (m) + (l ± ^(t))F(m) • ^(t)(V/)*(m)y(m), 

which proves (102). 

Next, combining Lemma 5.19 with (102), we shall prove that 

^Pfg^^) e M*<'))(. + *)-^(-wr(») ±0 (.)) <-) 

for every e G 0(G m ). 

To obtain (104), recall first that, by Lemma 5.19 and the event Q(m), we have 

Next, note that, using the event Q(m), the final term in (102) can be simplified as follows: 

e (1 ± 2 9x(t) ) • « • (V7)*(m). 
gAt)Y(m)Q(m) X " n 3 ' 2 

Moreover, observe that 

X(m) _ 1 



Y(m)Q(m) t ■ rfil 2 ' 
Thus, using (102), it follows that the left-hand side of (104) is contained in 
1 / {l±g a+1 (t))V°(m) St ,CgM\ , , Art 

gW){ QH + ^ ± ^J-^ 1±2 ^-^- (V;) (m) - (1 ° 6) 



82 GONZALO FIZ PONTIVEROS, SIMON GRIFFITHS, AND ROBERT MORRIS 

Now, observe that 

1 fV:(m) 8t\_ 8t {V°(m)-Y(m)\ _ 8t 



9a{t)\Q{m) nVy n 3 / 2 V g a (t)Y(m) J n 3/2 
It follows that (106) is contained in 

8t ra . w± ?f^_( 1±29l ( ( )).ifl. WTW± - 



n 3/2 ve/v / Q(rn) K ' n*/ 2 Ve/V ' t ■ n 3 / 2 

C (1 ± g x (t)) (8 + 4r) — ( - (K <7 )*( m ) ± 0(e)) 

as required, since g a+ i(t) — e ■ g a (t), t ^ ui and g x {t) <^ ui ■ g c {t). 

Using (104), we can now easily prove (94). Indeed, by Lemma 5.28 we have 

E\A(V°Y( )] - E l- AV ^ m ^ ~ Af » ~ + 1) ■ A(g a (t)Y(m)) \ G m ] 

and differentiating gives 

I /8t 2 — 1\ ~ 1 /4t 2 — 1\ 

Ay(m)»-^l —^\Y{m) and A(g a (t)Y(m)) ^ - ^i-—^) g<T (t)Y(m). 

Moreover, by Lemma 5.14, together with Lemma 4.23, we have 

^^•W^)-* (107) 

Combining these bounds, it follows that 

, 9 ^(« + + «™»> ± .a» ■ M~ 

and hence, by (104), 

E[A(KTN] G (4 + 4r±e)~(-(l7r(m)±0( e )), 
as required. □ 

5.5. The Lines of Peril and Death. In order to apply the method of Section 3 to the 
variables V°, we need one more lemma, which bounds the maximum and expected absolute 
single-step changes in (V°)*. 

Lemma 5.29. Let u ■ n 3/2 < m < m* . If £(m) nU(m) H X(m) H Z(m) H Q(m) holds, then 
\A(V:T(m)\ < C '^ Y and E[|A(W(m)l]< ' ^ " 



^(t)F(m) ' u v e / v y.j - ^ . n 3/2 

/or every e G 0(G m ) and every k-short sequence a G {L, R}* . 
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Lemma 5.29 is an easy consequence of the following simple observation, combined with 
the results above. 

Lemma 5.30. Let u ■ n 3/2 < m < m* . If £(m) C\U(m) n X(m) fl Z(m) n Q(m) holds, then 

Q{m) 

for every e G 0(G m ) and every k-short sequence a G {L, R}* . 

Proof If \a\ = then the claimed bound holds trivially, so assume that |er| ^ 1. To prove 
the lemma, observe that a cr-walk W is destroyed in step m + 1 of the triangle-free process 
only if the edge /i added in that step is a ^-neighbour of an edge of W. Moreover, since the 
event £(m) r\U(m) fl X{m) D Z(m) holds, and since Y(m) 2 3> X{m) for t ^ w, it follows 
from Lemma 5.20 that E[AC/^(m)] < 0, and hence that the expected number of cr-walks 
from e created in step m + 1 is fewer than the expected number destroyed. Thus 

by the event ti{m) fl Q(m), as required. □ 

Proof of Lemma 5.29. The first inequality follows easily from Lemma 5.14, using Lemma 4.23. 
Indeed, since £{m) fl Z{m) holds, we have 

\ HK y (m) \ < 2 /jAVTWJ + logn\ C(logn)« 
1 »<>(«) V Y(m) n 3 *) g.(t)Y(m) 

by Lemma 5.14, and since a is fc-short, as claimed. 

To bound E[|A(V e °")*(m)|] , observe that (cf. Lemma 5.28), for any positive functions A 
and B, 



A 



A(m) 
B{m) 



\AA(m)\ A{m+l)\AB{m)\ 



B(m) B(m) ■ B(m + 1) ' 
Applying this to U°(m)Vf(m) = U° L {m) + U° R (m), we obtain 

< |A^(m)| + |A^(m)| + V" {m + 1) ■ |A^(m)| 

Combining this bound with Lemma 5.30, and using the event U(m), we obtain 

C 2 ■ Y(m) 2 



E AK ff m 



e 



Hence, by (108), we have 




E[|A(HT(m)|] < 4r.|^=+5| < r °" S " 



9,(t) \ Y(m) j ~~ <)„(«)■ n 3 / 2 ' 

as required. □ 
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We are now ready to prove Proposition 5.4. 

Proof of Proposition 5.4- We shall bound the probabilities of the event 

U(m- 1) n V{m) c r)K, y (m- 1) (109) 

for each u> ■ n 3 / 2 < m ^ m*. In order to do so, we shall apply our usual martingale method 
to the variables V" for each edge e and each /c-short sequence a G {L, R}*. 

As usual, we begin by choosing a family of parameters as in Definition 3.4. Set /C(m) = 
€(m) n U{m) n V(m) n X(m) H y(a) n -2(ra) n Q(m) and / = [a, b] = [u ■ n 3/2 , m*], let 

a(t) = C - {X ° gnf and 0(f)- C ' l0gn 



^(t)F(m) • " W g ff (t)-nW 

and set A = C and 5 = e and fo(f) = t ■ n~ 3 / 2 . We claim that (A, 5; g a , h; a, f3; JC) is a 
reasonable collection, and that V e CT satisfy the conditions of Lemma 3.5. 

The first statement follows easily, since a and (3 are clearly A-slow, and the bounds 
min {«(£), /3(t), h(t)} ^ 4^ and a(i) ^ e 2 follow from the fact that n~ 1/4 ^ g a (t) ^ 1 
for every t ^ £*. To prove the second, we need to show that the variables V° are (g a , h; K)- 
self-correcting, which follows from Lemma 5.18, and that, for every to ■ n 3//2 < m ^ m*, if 
K{m) holds then 

|A(KT(m)| < <*(t) and E[|A(KT Ml] < 

which follows from Lemma 5.29. Moreover, the bound |(V e °")*(a)| < 1/2 follows from the 
event y(a), since f y (uj)Y(n 3 / 2 ) <C (7j,(u;)Y(a) if u)(n) — >■ oo sufficiently slowly. 
Finally, observe that 

a(t)/3(t)n 3 / 2 < C ' (bgn)4 ■ ^f 1 < 1 



g a (t)Y(m) g*(t) ; (logn) 3 

for every u> < t ^ t*, since g CT (t) 2 F(m) ^ e 3fc2 • u ■ (logn) 8 . By Lemma 3.5, it follows that 

P(V(m) c n/C(m- 1) for some m G [a,6]) ^ n 7 exp ( - 5'(logn) 3 ) < n- 2Clogn , 

where we summed over edges e G E(K n ) and /c-short sequences <r G {L, i?}* the probability 
that e G 0{G rn ) and (V e cr )*(m) > 1. This gives us a bound on the probability that one of 
the events in (109) occurs for the first time at step m! of the triangle-free process; summing 
over choices of m' ^ m, this gives a bound on the probability of the event 

U{m - 1) n V{m) c n IC y (m - 1). 

Using Proposition 5.5 to bound the probability of the event U (m — l) c (l)C y (m — 1) , we obtain 
the claimed bound on the probability of the event V(m) c D JC y (m — 1), as required. □ 

Finally, note that Proposition 5.1 follows immediately from Proposition 5.4. 
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6. Whirlpools and Lyapunov functions 

In this section we shall prove the following theorem, which together with the results of 
the previous three sections, implies Theorems 2.1 and 2.4 and hence the lower bound in 
Theorem 1.1. 

Theorem 6.1. For every to ■ n 3//2 < m ^ m* , with probability at least 1 — n~ 2logn either 
(A"(m — 1) fl y(m — 1)) or the following holds: 

(a)%p\ei±g q (t), (b) §^ei± 9q (t), (c)l^ei± 9q (t). (no) 

Q(m) X(m) Y(m) 

The proof of Theorem 6.1 is roughly as follows. We shall first show how the normalized 
errors of X, Y and Q depend on one another; in particular, we shall show that X is self- 
correcting (given bounds on Y and Q), whereas Y and Q have a more complicated two- 
dimensional interaction which resembles a whirlpool, see Figure 6.1. 

Since the eigenvalues of this interaction turn out to be negative (in fact, both are equal 
to —1), we will be able to define a Lyapunov function, A(m), which is self- correcting and is 
bounded if and only if the normalized errors of both Y and Q are bounded. The required 
bounds on X, Y and Q then follow easily by our usual Line of Peril / Line of Death argument. 

6.1. Whirlpools. The first step in the proof of Theorem 6.1 is the following lemma, which 
shows how the normalized errors of X(m), Y(m) and Q(m) depend on one another. Set 

T(m) = *(™)-*(™) , T{m) = Y (m)-Y(m) and { ) = Q(m) -Q(m) 
g q (t)X(m) g q {t)Y{m) V 9 q (t)Q(m) 

for each m G N. 

Lemma 6.2. Let uj ■ n 3/2 < m < m*. If X{m) fl y(m) n Q(m) holds, then 

(a) E[AQ*(m)] = ^ ( - 2F*(m) + Q*(m)) . 

(b) E[AY*(m)] e (l±e)-J^(-3F*(m) + 2Q*(m)±e). 

(c) E[AX*(m)] E (l±e)-^^-T(m)-4Y*(m)+4Q*(m)±ey 

We shall only sketch the proof of Lemma 6.2 here, and defer the (lengthy, but fairly 
straightforward) details to the Appendix. We first recall the following lemmas, which were 
already stated in Section 2. 

Lemma 6.3. For every m£N, 

E[AQ(m)] = -F(m) - 1. 
Lemma 6.4. Let u ■ n 3 / 2 < m < m*. If X(m) fl y{m) fl Q(m) holds, then 

E[AY(m)] e —^—(-Y(m) 2 + X(m) -2- Var(F e (m)) ±0(Y(m))). 
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Lemma 6.5. Let uj ■ n 3 ^ 2 < m < m* . If X(m) fl y{m) fl Q{m) holds, then 



E 



[AI(m)] G ^^^-2-X(m)r(m)-3-Cov(X,F)±0(^X(m) + f(m) 2 ^. 



Lemma 6.3 is trivial, since if edge e is chosen in step m + 1, then AQ(m) = —Y e (m) — 1. 
In order to prove Lemmas 6.4 and 6.5, we use the variables 

Y(m) = Y e( m ) and X ( m ) = Yl X e( m )> 

eeQ(m) eeQ(m) 

which are exactly twice the number of edges in the F-graph, and six times the number of 
open triangles in G m , respectively. Using the fact that the F-graph is triangle-free (which 
follows since G rn is triangle- free), it is not hard to show that if edge e is added at step m+1, 
then 

AY(m) = X e (m) - 2 Y f( m )i ( m ) 

feY e (m) 

and moreover 

AX(m) € -3 X /( m ) ± 0(x(m) + F(m) 2 ) (112) 

feY e (m) 

assuming X(m) fl y{m) fl Q(m) holds. Summing over open edges e G Q(m) gives 
and 

E[AX(m)] G ~~qi — \ X /( m ) ' y /( m ) ± 0(X(m) + F(m) 2 ). 

Finally, a straightforward calculation shows that if X(m) fl y(m) fl Q(m), then 

E[AY(m)] F(m) 2 ±Q(F(m)) 

E[AF(m)] + — qh — 

and 

r —, xl E[AX(m)l A(m)F(m) ±0(A(m) +y(m)) 

Lemmas 6.4 and 6.5 now follow easily, see the Appendix for details. 

To deduce Lemma 6.2, we simply use Lemma 5.28 to differentiate X , Y and Q*, and 
observe that X(m) <C Y(m) 2 , since t ^ u, and that 

Var(y(m)) < g q {t)Y{mf and Cov(A(m), Y(m)) < ^(t)X(m)F(m), 

since A'(m) and iV(m) hold. The lemma now follows via a straightforward calculation. 
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(113) 



Figure 6.1. The whirlpools of (Y*,Q*) and (X,ji). 

6.2. Lyapunov functions. Note that, by Lemma 6.2, although X* and Y* are self-correcting, 
Q* is not. We shall therefore have to define a slightly more complicated martingale than 
in the previous sections. Fortunately, the classical work of Lyapunov [39] provides us with 
exactly the function of Y and Q* which we need. Indeed, let us rewrite the vector (Y , Q*) 
using the change of basis 

Q* ) = " ( 4 3 ) ( n 
and define a new parameter 

A(m) = A(m) 2 + /i(m) 2 . 
The following result will imply the bounds on Y(m) and Q(m) in Theorem 6.1. 

Proposition 6.6. For each to ■ n 3 / 2 < m ^ m* , set Kirn) = X(m) PI [V(m) D Q(m). Then 

P((A(m) > 1) H /C(m - 1) for some w • n 3/2 < m < m* \ ^ n~ clogn . 

We will prove Proposition 6.6 using the our usual martingale method; as always, we shall 
need to bound the maximum possible and expected single-step changes. 

Lemma 6.7. Let to ■ n 3 ^ 2 < m ^ m* , and suppose that X(m) D y(m) D Q(m) holds. Then 

\AX*(m)\ + \AY\m)\ + |AQ*(m)l < g ^y^ ( 114 ) 

and aence 

' AA WI<J^ md E [|AA(-)I]< (115) 
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Once again, we only sketch the proof, and postpone the (easy) details to the Appendix. 

Sketch of proof. The inequality (114) follows easily from (111) and (112), together with 
bounds corresponding to (107) for X, Y and Q, via a straightforward calculation (see the 
Appendix). To deduce (115), simply observe that 



i<M)| 2 , 



|AA(to)| < 2(jA(m) • AA(m)| + \fi(m) • A//(to)|) + |AA(m)| 2 + \A/i(i 
and use (114) and the event Q(m) to bound the various terms. □ 

We also need to show that A(m) is self-correcting. 
Lemma 6.8. If X(m) n y(m) fl Q(m) holds, then 

E[AA(to)] < J^(-A(to) + 0( £ )). (116) 

Proof. Note first that 

1 / -3Y* + 2Q* \ f -3 2 \ / 4 5 \ / A \ / -4A - 9fi 
e V -2T + Q* J ~ V "2 lA 4 3 / W V -4A-7/X 

and hence, by (113) and Lemma 6.2, 

4 -E[AA(to)] +5-E[A//(m)] = ^-E[AF*(m)] G (1 ± e) • ( - 4A(to) - 9//(m) ± e) 

4 • E [AA(m)] + 3 ■ E [A// (to)] = --E[AQ*(m)] = ~ 4A ( m ) ~ 7^(m)). 

Note that |A| + = 0(1), since the event Q(m) holds. It follows that 

„3/2 ^3/2 

— -E[AA(to)] G -A(ra) -/i(m) ±0(e) and — • E[A//(to)] G -//(to) ± O(e), 
and hence 

^ ■ ( E ^ A(m) ] ^ ■ ^ X ^ * < -2A(to) 2 -2A(to)M-)-2M-) 2 + 0(,) 



2t V E[A/i(m)] y V 

= -(A(m) 2 + /i(m) 2 + (A(m) + /i(m)) 2 ) + 0(e) ^ -A(m) + 0(e). 

Noting that 

AA(m) = (AA(m)) 2 + (Ayu(m)) 2 + 2^A(m) • AA(to) + //(to) • A//(to)^, 
and using (114) to bound AA(to) and A/t(m), the bound (116) follows. □ 

We can now prove Proposition 6.6, using the method of Section 3. To be precise, we shall 
show that A has all of the properties required of A* in the statement of Lemma 3.5, and 
deduce that therefore the conclusion of the lemma holds with A* = A. 
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Proof of Proposition 6.6. Let /C(to) = X(m) fl y(m) fl Q(m) for each m G [to*], set / = 
[a, b] = [u ■ n 3 / 2 , to*] and 

«*> - W) - 

and observe that a and /3 are C-slow and satisfy ^§72 ^ a(t) = (3(t) ^ e 2 . By Lemma 6.8, if 
JC(m) holds and A(to) ^ 1/2 then 

E[AA(to)] < 

which (since A(to) ^ 0) is exactly the required self-correction condition. Moreover, by 
Lemma 6.7, we have 

\AA{m)\^a{t) and E[|AA(to)|] < p(t) 
for every to G [a, 6] for which /C(to) holds. It now simply remains to observe that 

-!)(£)■ 

and that if /C(o) holds then \Y*(a)\ + |Q*(a)| < e 2 , since f y (u)Y(n 3 ^) < ^(w)y(a), and 
similarly f y (u)Q(n 3 ^ 2 ) <C g q (u)Q(a). Hence, 

A(a) = A(a) 2 + /i(a) 2 < ^ • (|F* (o)| + |Q*(a)|) 2 < 1 
as required. Finally, observe that 

/ n n i/9 (logn) 8 1 

for every uj < t ^t*. By Lemma 3.5, it follows that 

p((A(to) > 1) n /C(to - 1) for some to G [a, 6]) < n 4 exp ( - 5'n) < n" clogn , 

as required. □ 

We are finally ready to prove Theorem 6.1. 

Proof of Theorem 6.1. The bounds on Y(m) and Q(m) follow easily from Proposition 6.6, 
since if A(to) < 1 then |F*(to)| + \Q*(m)\ < 20e. Set 

K(m) = X(m) n y(m) n Q(to) n (|F*(to)| + |Q*(to)| ^ 20e) 
for each to G [to*], and let / = [a, 6] = [cu ■ n 3 / 2 , to*]. We claim that 

p(jX*(m) > 1) n /C(to - 1) for some to G [a, 6]) < n- closn . 
As usual, we shall apply the method of Section 3. Indeed, set h(t) = t ■ n~ 3 l 2 and 
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and observe that a and (5 are C-slow and satisfy ^§72 ^ a {t) — f3(t) ^ s 2 . Observe that X 
is (g q , h] /C)-self-correcting on [a,b] since if /C(m) holds then 

E[AX*(m)} G (l±e)~(-X*(m)±0(e)). 

by Lemma 6.2, and moreover 

\AX*(m)\ ^ a(t) and E[\AX*{m)\] < 

by Lemma 6.7. Note also that if /C(a) holds then |X*(a)| < 1/2, since /;j.(w)X(n 3 / 2 ) <C 
g q {ui)X{a) if w(n) — >■ 00 sufficiently slowly. Finally, observe that 

' 4 I 
g g (t) 2 • n 3 / 2 n 

for every u < t ^t*. By Lemma 3.5, it follows that 

p((Z*(m) > 1) n /C(m - 1) for some m G [a, 6]) ^ n 4 cxp ( - 5'n) ^ n - clogn , 
as claimed. Combining this bound with Proposition 6.6, the theorem follows. □ 

6.3. The proof of Theorems 2.1, 2.4, 2.5, 2.7 and 2.11. We end this section by deducing 
the main results of Section 2. 

Theorem 6.9. With high probability, the events £{m) , V{m), X{m), y{m), Z{m) and Q{m) 
all hold for every m ^ m* . Or, more precisely, 

P(£(m*) n V{m*) n X{m*) H y(m*) n Z(m*) H Q(m*)) > 1 - n~ logn (118) 

/or a// sufficiently large n G N. 

Proof. Consider the 70 first of the events to go astray, and suppose that it does so in step m 
of the triangle-free process. The probability of this event is controlled by: 

- Theorem 4.1 (if £(m) c holds), 

- Propositions 4.56 and 4.57 (if {X(m) fl y(m) fl <2(m)) c holds and t ^ u), 

- Proposition 3.8 (if X{m) c holds and t > u), 

- Propositions 5.1 and 5.4 (if (V(m) fl y{m)) c holds and t > oS), 

- Proposition 3.7 (if Z[m) c holds) and, finally, 

- Theorem 6.1 (if Q(m) c holds and t > cu). 

Summing the probabilities in these statements, we obtain (118), as required. □ 

Theorems 2.1, 2.4, 2.5, 2.7 and 2.11 all follow immediately from Theorem 6.9. 



70 Or, more precisely, a first event, since more than one might fail in the same step. 
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7. Independent sets and maximum degrees in G niA 

In this section we shall control the maximum degree and the independence number of the 
graph G ni A- Recall that e > is an arbitrary, sufficiently small constant, set 7 = 10y/e, and 
choose 5 = 5(e) > sufficiently small. 71 Moreover, let ni(e, 5, C,u) G N be sufficiently large. 

We shall prove the following propositions. Together with Theorem 6.9, they imply Theo- 
rem 2.12, and hence complete the proofs of Theorems 1.1 and 1.2. 

Proposition 7.1. If n ^ ni(e,6,C,u), then with probability at least 1 — e _v/ ™ ; either 

A(G n , A ) < (t! + 7 ) y/^ogn, 

or £(m) fl y(m) fl Z(m) fl Q(m) /aiZs to /io/d /or some m ^ m* . 

Proposition 7.2. If n ^ 0", C, a;), then with probability at least 1 — e _v/ ™ either 

a(G nA ) < (V2 + >y)\/n\ogn, 

or £(m) fl iV(m) fl Z(m) fl Q(m) fails to hold for some m ^ m* . 

Both propositions will follow by essentially the same argument, with a few key changes. 
We begin by outlining the main ideas of the proof. 



7.1. A sketch of the proof. The basic idea behind Proposition 7.2 is that a typical set of 
vertices S will contain roughly ('f )e~ 4 * 2 open edges at time t. If this were to hold for every 
set S, then the proof would be easy, since the probability of choosing an edge inside S would 
be (roughly) \S\ 2 /n 2 in each step, and thus the expected number of independent sets of size 
s in G m * would be roughly 




which tends to zero if s > (v^2 + ^x/n logn. Indeed, an easy application of our usual 
martingale method (see Lemma 7.19, below) will allow us to make this calculation rigorous 
for sets S which contain at most n s elements of each neighbourhood in G m *. On the other 
hand, for those sets S which intersect some neighbourhood in at least this many vertices, we 
shall have to do something quite different, see below. 

For Proposition 7.1 it is quite tricky even to come up with the right heuristic. One natural 
approach is to note that if the event £(m*) holds, then every vertex v in G m * has degree 
roughly 2m* /n, and open degree roughly ne -4 ^**^ ~ n l ^ 2+e . Since a set of size n 1 ^ 2+e is 
unlikely to contain an independent set of size ^y/n logn, and there are only n choices for v, 
we should be done. However, the events involved in this calculation are not independent, 
and we have not succeeded in making this argument rigorous. 



Since C — C(e) was arbitrary, we may assume that C — C(e, 5) is sufficiently large. 
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Instead we shall use the fact that, again by the event £(m*), every vertex v has degree 
roughly 2m /n in G m for every m ^ m*. It follows that, if S is the neighbourhood of a 
vertex v in G„ 5 a, then we would expect S to contain roughly (('f ) — ^r)e~ 4 * 2 open edges at 
time t, since roughly 2m 2 /n 2 edges of S will have been closed by v by this time. Moreover, 
we can approximate the probability that S C N(v) by summing over the sequence of steps 
at which the edges are added; the probability that a particular open edge e is added in step 
m is exactly 1/Q(m). Since we have about ( m *) 2m */«- choices for this sequence, we should 
obtain an upper bound on the probability that N(v) — S of roughly 

, ^•a./Tt 7 " 1 \n/, l s l 2 , 4m2 \ ( ">* fW- \S\'\\ 2m ' ,n 

(m > ( fi Qijm) n (i — + -j - (^exp (_ - _j j 

where we used Lemma 7.7 to estimate the first product. Summing over choices of v and S, 
we would thus obtain an upper bound on the probability that there exists a vertex of degree 
at least s = (2 + r y)m*/n in G n ,A of 



2m* /n 

- . m . g2(i*) 2 /3-s 2 /2n ] w n - 7 s 

S n ll/6 



as required, since e 2 ^**^ ps n x ^~ e and s 2 /2n (| + t) logn. 

Once again, the outline above can only be made rigorous if there are no other vertices 
which send many edges into S. In the next subsection we shall describe how we deal with 
the other cases. 



7.2. Partitioning the bad events. As the reader will have noticed from the discussion in 
the previous subsection, it is not true that the number of open edges in S is well-behaved for 
every set S of size 0(m*/n); indeed, those sets which happen to have a large intersection with 
the neighbourhood(s) of some (or many) vertices will have fewer open edges than expected. 
This motivates the following definition. 72 

Definition 7.3. Given a set S C V{G m ) and 5 > 0, define J = J(S, 5) = {v±, . . . , v\j\} and 
a = a.(S, 5) — (a±, . . . , a\j\) to be the following random variables: 

J(S,5) = [veV(G m ) : \N Gm .(v)nS\ >n 5 Y 

and dj = \Nq ,(vj) H S\, where the labels are chosen so that a± ^ . . . ^ a\j\. 

Given a set S C V(G m ) and m G N, let T(S, m) denote the event that S is an independent 
set in G m . It is easy to see that the following events form a cover of I(S, m*). 



We remark that if e(G n ^) < m* , i.e., if the triangle-free process ends before step m* , then we may 
define G m » = G„.a- Since in that case the event Q(m*) c holds, this choice does not affect the validity of 
either Proposition 7.1 or 7.2. 
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Definition 7.4. Given a set S C V(G n ^) and 5 > 0, define: 

(1) .4(5,5) = I(S,m*) n {o(G m [S]) > (1 - e)(lfl ) e - 4 * 2 for every m < m*}. 

(ii) B(S,5) = {Ej fl i < 5 \ S \ 2 } n {°( G m[S}) < (l-£)('f)e- 4 * 2 for some m ^ m*}. 
(Hi) C(S,S) = X( 1 S,m*)n{E J aI^ <5 l' S l 2 } n {EjS<^ 1/2+25 }- 
(iv) V(S,5) = l(S,m*) n { Ej«i > ™ 1/2+M }- 

We shall bound from above the probability that both £(m*) D y{m*) fl Z{m*) fl Q(to*) 
and the event 

[J .4(5,5) U#(5,<5) UC(S,5) UV(S,8) D \J J(5,to*) (119) 

SCV(G m ):\S\=s SCV(G m ):\S\=s 

hold, where s = (y/2 + 7) y/n log n. The easiest of the probabilities to bound is that of 
A(S,5) fl <2(m*) (see Lemma 7.6), which follows from a simple calculation, as outlined 
above. Bounding the probability of B(S,8) fl y(m*) fl Q(m*) fl X>(5, <5) c is also relatively 
straightforward (see Lemma 7.9): indeed, we simply apply our usual martingale method, 
using the fact that Ej° 2 < ^ fi2 to control the maximum possible size of a single step. 
Dealing with the events C(S,5) and \J\ S \ =S T>(S, 5) is significantly harder, and we postpone 
a discussion of the ideas involved to later in the section. 

Next, let us turn our attention to the event that G Ht /\ contains a vertex whose degree is 
significantly larger than 2m* /n. Given a set S C V{G m ) and a vertex v G V(G m ), observe 
first that if N(v) — 5 in G n ^ then 

(a) S is independent in G m *. 

(b) {u,v} G 0(G m *) U E(G m *) for every u G S. 

(c) N(v) C 5 in G m *. 

It follows that the event 1(5', to*) fl W(5, t>) must hold, where 

W(S,v) = {{u,v} G 0(G m *)UE(G m *) for every u G s} D {iV Gm » C 5}. (120) 
Note also that the event £(m*) implies that 

<fc»G— ±Vn (121) 

for every to ^ to*. Motivated by this, we define the following events, which are slight 
modifications of those above. 

Definition 7.5. Given a set S C 1 / (G , „ i a) and 5 > 0, define: 

(2) „4'(5,<5) = l(S,m*) n {o(G m [5]) >(1 - £)(( l f l ) - ^)e" 4i2 for every to < m*}. 

(«) B'(S,S) = { El J i 2 a 2 < *» 2 } n |J {o(G m [SJ) < (1 - £ )((lfl) - ^)e- 4 * 2 }. 

m=l 

(m) C'(S,*) = { Ei J i« 2 > 6s2 } n { £j«j < n^ +2S ). 
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Let T(S,v) denote the event that N Gn A (v) = S. We claim that 

[J T(S,v) n£(m*) C (J (a'(s,5) llB'(S,5) UC'(S,5) UV(S,S)j n W(S,v) (122) 

\S\>s \S\=s 

for every s ^ 2m* /n+ yfn. Indeed, we have already observed that T(S,v) implies W(S,v), 
and by (120) and (121), the event W(S,v) H S(m*) implies that W(S',v) holds for some 
S' C S with | S' | = s. The other implications now follow exactly as before, since the 
neighbourhood of a vertex in G n ,A is an independent set in G m *. 

We can now bound the probability that the various events hold for some S with |*S'j = s 
as before, the main differences being (as noted above) that bounding F(A'(S, 5) fl W(S, v) D 
£{rrf) fl Q(m*)) is slightly more technical than bounding F(A(S,5) fl Q(m*)), and that 
dealing with the event C'(S, 5) is relatively easy, since we shall be able to show that 

C'(S, 5) n W(S, v) C (^(m*) n 2(m*)) c , 

which implies that the bad event corresponding to C'(S,S) is in fact impossible. 

7.3. The events A(S,S) and .A' (5, 5). We begin with the easiest part of the proof, which 
requires only some straightforward counting. For each ^ m' ^ to*, let 0(S, to') denote the 
event that o(G m [S}) >(1 - £)( l f l )e" 4 ' 2 for every < to < m'. 

Lemma 7.6. If s ^ [y/2 + 7) v^logn, £/ien 

^ P^(S,i)nQ(m*)) < n-* s . 

S:|S|=s 

Proof. Recall first that ^.(S 1 , 5) = I(S,m*) fl 0(S,m*). The lemma is an easy consequence 
of the following observation: the probability that an open edge inside S is chosen in step 
to + 1 is exactly o(G f m [<S'])/Q(m). Hence 

F(l(S,m*)nO(S,m*)nQ(m*)) < max TT fl - °^ m ^^ . 
V V o(s, m *)nsK) 11 I Q(m) / 

m=0 x 

where the maximum is over all realizations of the triangle-free process for which both 
0(S,m*) and Q(to*) hold. Since 0(S,m*) n Q(to*) implies that 

°(GWfl) ^ (1 - 2 g )|^| 2 
Q{m) n 2 

for every ^ to ^ m*, and sm* > (>/2 + 7)(2^-e)ra 2 logn> (l/2 + 2e)n 2 logn, it follows 
that 

E wmw) < (:)-p(- ii ^ ,: ) < (t-s^)' « 

as claimed. □ 
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Next, let's make precise the calculation sketched earlier for A'(S,5). For each edge / G 
E(G m *), let m{f) G [m*] denote the step of the triangle-free process at which it was added, 
i.e., such that / G E(G m (j)) \ E(G m ^-i). We shall need the following simple lemma. 

Lemma 7.7. Let v G V(G m ) and let fi, ■ ■ ■ , fd be the edges of G m * which are incident to v. 
If£(m*) D Q(m*) holds, then 

Proof. Since £(m*) holds, we have d Gm (v) G ^ ± y/n for every < m ^ to*, and thus in 
particular 

2,171* 7 

d < + y/n = (2t* + l)y/n and t(fj) ^ + 1 

for every j G [d], where t(fj) = m(fj) -n~ 3 / 2 . Moreover, since £(m*) D Q(to*) holds, we have 
d » \fn and Q(m(fj)) > e~ 4 ^'n 2 /4 for each j G [d]. It follows that 

n-^flQWm ^exp(-4^(/,) 2 ) £^(-£(£ + ^= + 5)) 



n 2 



on J \ \6 J J \ 

as claimed. The final step holds since 8(t*) 2 ^ (1 — e) logn. □ 
For each < wl < to*, let 0'{S,m!) denote the event that the inequality 

o{G m [S\) > 

holds for every ^ to ^ to'. 

Lemma 7.8. If s ^ + 7) y/n\ogn and v G V(G m ), then 

w(a'(S,6) n W(S,v) n£(m*) n Q(to*)) < n~ 5s . (123) 

S:|S|=s 

Proo/. Recall first that ^'(5, 5) = X(S, to*) n £>'(S, m *)> and note that if Q'(S, to*) n Q(to*) 
holds then 

o(G m [S]) 



Q(m) 

for every ^ to ^ to*. Let /1, . . . , fd be the edges of G m » which are incident to i>, and note 
that N Gm , (v) C 5, since W(S', t>) holds, and that 
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since S(m*) holds. Given edges fi, . . . , f d and steps m(/i), . . . , m(fd) G [m*] such that 
da m (v) satisfies (121) for every m G [m*] , we shall bound the probability that the edge fj is 
chosen 73 in step m(fj) for each j G [d], and that at every other step, we do not choose an 
open edge in S. Having done so, it will suffice to sum over the at most 2 s (m*) d choices for 
the edges fj and steps m(fj). 

Note first that, by Lemma 7.7, and since the event £(m*) fl Q(m*) holds, the probability 
that the edge fj is chosen in step m(fj) for each j G [d] is 

ff n~ lld ' & . (125) 

Moreover, by (124), the probability that at every other step we do not choose an open edge 
in S is at most 

n (i - < «p( - (— - ). d26) 

where M = [m*\ \ {m(/i), . . . ,m(f d )}. Note that s 2 = d 2 + (s - d)(s + d), and that 

m*d fl \, (s + dW /2 + 7 \, , (m*) 3 rflogn 

> - - 3e logn, ^ / > — ^ logn and < 



n 2 " V 4 / ™ 2 V 4 / ™ 4 16 

for every d E D, and hence 

s 2 m* 4(m*) 3 /l \ /2 + 7V dlogn 



2 



3n4 ^ (--3.1rflogn+l — l( S -rf)logn 12 



It follows that the right-hand side of (126) is at most 

exp ( - Q - 4 ^) rflogn - (^p) ( s - d ) lo S™) ( 127 ) 

Hence, combining (125) and (127), and summing over sets S, integers d E D, edges 
fi, . . . , fd and steps m{fi), . . . ,m(fd), and noting that s — d ^ 7c?, we obtain an upper 
bound on the left-hand side of (123) of 

(3E^-p(-(i-*)««.-('-f)MM) 

^ \ s n n /6 77,1/6 I \ S nV2+7/6 J ^ ^ 
n \ / \ / Jen 



-<5s 

deD x ~ - / \ - / deD 

as required. □ 



7ci Note that we do not lose anything by assuming that fj is open after m(fj) — 1 steps, since this follows 
automatically if N Gm „ (v) C 5 and S is an independent set. 
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7.4. The events B{S, 5) n V(S, 5) c and B'(S, 5) n £>(S, 5) c . We shall next apply our usual 
martingale method in order to show that o[G m [S]) is (with very high probability) well- 
behaved, as long as a 2 < 5s 2 and ^ j a j < n l l 2+2S . (The latter condition will be necessary 
in order to bound the number of choices for the neighbourhoods in S of the vertices of J.) 
For convenience, we remind the reader of the following notation: 

• B(S, 5) = { Ei=i a 2 < 5\S\ 2 } n 0(S, m*) c , where 

0(S,m*) = L(G m [S}) > (1 -e)0 for every m E [m*}\. 



, 2 , 

• B'(S,5) = { Ei=2«' < 5 \ S \ 2 } n 0'(S,m*) c , where 

0'(S,m*) = | (C7 m [5])^(l-e)^ l ^-^e- 4 * 2 foreverymGK]}- 

We shall prove the following two lemmas. 
Lemma 7.9. If 3y/n ^ s ^ n 1 / 2 ^, t/ien 

F^3{S,5)ny{m*)nQ{m*)nV(S,5) c ^ ^ e~ snS 

for every S C l / (G m ) itrai/i | jS' j = s. 

As in the previous subsection, in the maximum degree setting we shall use the event S(m*) 
to control the number of G m -neighbours of the vertex v\ in S, for each m E [m*]. 

Lemma 7.10. If + 7) y/n\ogn ^ s ^ n l / 2+e , then 

F(B'(S, 5) n £(m*) n y(m*) n Q(m*) n V(S, 5) c ) < e" sn * 

for every S C V(G m ) j^l = s. 

We shall use the following notation in the proofs of Lemmas 7.9 and 7.10. Given m E N, 
a set 5 C y(G m ) and a collection Af = {A\, . . . , of subsets of S, set 

k 

N {S,m) = 0{G m [S]) \\JO(G m [Aj]), 

3=1 

and set oj^(S,m) = \Oj^(S,m)\. Set g (t) = n 3S g x (t), define the normalized error to be 

* (q , _ Qjv(S,m) -e- u2 om(S,0) 
° AS ' m) ~ So(t)e--V(S,0) ' 

and write Oj^(S,m') for the event that \oj^(S, m)\ ^ 1 for every m ^ m' . Note in particular 
that g (t) <C 1 for every < t ^ t*, since e 4( **) ^ n 1 / 2-2 , and observe that therefore if 
M (S,m) holds and l^| 2 < £|S| 2 , then 0(5, m) also holds. 
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In a slight abuse of notation, given a collection Af as above, we define Af(S,m) to be 
the event 74 that N Gm+1 (v) D S C Aj for some j G [k], for every vertex v G V(G m ) with 
\ N G m (v) nS\^ n 5 . That is, 

#(S,m)= f| (U{^i( w ) n5 ^^}u{l^o m («)n5|<n'}Y 

feV(G m ) S=i ' 

Observe that if two edges ei,e 2 G 0(G m [S']) are closed by the addition (in step m + 1) of 
edge /, then ei, e 2 and / share a common vertex, and ei,e 2 G Yf(m). Moreover, e\ and e 2 
are both contained in the neighbourhood in G m+ \ of one of the endpoints of /. It follows 
immediately that if Af(S,m) holds, then \ Aoj\f(S,m)\ ^ n s . 

We will deduce Lemmas 7.9 and 7.10 from the following result, which follows by the 
martingale method of Section 3. Since the proof is similar to several of those above (cf. in 
particular the proof of Proposition 3.8 in Section 3.4), we shall omit some of the details. 

Lemma 7.11. Let S C V(G m ), and let Af = (A±, . . . , A k ) be a collection of subsets of S. If 
\S\y/n < o M (S,0) < n 5 / 4 , then 

-\S\n 45 



F(Ox(S,m) c r\Af(S,m) ny(m) C\ Q(m)j < e 
for every m G [m*] . 

We need to deal separately with the case m ^ cu ■ n 3 ^ 2 ; since we again use Bohman's 
method from [10] in this case, we postpone the details to the Appendix. 

Lemma 7.12. Let S C V(G m ), and let Af = (A 1: . . . , A k ) be a collection of subsets of S. If 
\S\y/n < 0^(5,0) < n 5 /\ then 

F^{\o* M (S,m)\ > 1} nAf(S,m) ny(m) H Q(m)) < 

for every m ^ cu ■ n 3 / 2 . 

Proof of Lemma 7.11. We shall apply the martingale method of Section 3. Indeed, we have 
E[Ao M (S,m)] = Yl ( y >M + 1 )' (128) 

Vl ' feO M (S,m) 

from which it follows easily (see Lemma A. 2.1 of the Appendix) that, for each u ■ n 3//2 < 
m < m*, if N (S,m) n y(m) H Q(m) holds then 

E[Ao* M (S,m)] G — (_ ^(5,m)±e). 

Hence, setting K(m) = M (S,m) C)Af(S,m) n ^(m) n Q(m) n {|o^(S,a)| < 1/2} and 
/ = [a, b] — [u • n 3//2 , m*], it follows that ojv(S, m) is (g c , /i; /C)-self-correcting on [a, 6], where 



74 We shall show that this event holds for some collection Af, see Observations 7.13 and 7.14, and then 
apply the union bound, using the event T>(S, 8) c in order to bound the number of choices for Af. 
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h(t) = t ■ rT^I 2 . Moreover, we claim that if /C(m) holds then 

\A 0Af (S,m)\ ^ n 5 and E[\Ao x (S,m)\] ^ ^ • e^o^S, 0). 

Indeed, the first inequality follows from the event J\f(S,m), as noted above, and the second 
from (128), combined with Oj\f(S,m) fl y(m) fl Q(m), since oj\f(S,m) is decreasing. Hence, 
by Lemma 4.23, we obtain 

Since a(i) and /?(£) are C-slow on [a, 6], and min {«(£), /3(t), ^ 7J72 and a(t) ^ £ 2 

for every u < t ^ t* (since \fn ^ o//(S,0) ^ n 5 ^ 4 ), it follows that (C, e; g Q , h; a, /3; IC) is a 
reasonable collection, and that o^(S,m) satisfies the conditions of Lemma 3.5. 
Finally, observe that 



<? o (*) 2 cw(S,0) ^ n 5 ^! 

for every u < t ^t*, since 0^(5", 0) > l-S'lv^ an d ^o(0 = n 3S g x (t). By Lemma 3.5, it follows 
that 

P(£V(S,m') c n K(m' - 1) for some m' e [a, &]) ^ n 4 exp ( - 5'n 55 |,S|) < n"^ 5 !. 

Combining this bound with Lemma 7.12, and summing over choices of w! ^ m, we obtain 
the claimed bound on the probability of the event 0^(S, m) c nJ\f(S, m) fl y(m) fl Q(m). □ 

Lemmas 7.9 and 7.10 follow by applying Lemma 7.11 to all possible collections Af of sets 
{N Gmt (v) fl S : v e J}. We shall need the following two simple observations. 

Observation 7.13. Let 5 C V(G rn ), and suppose that B(S, 5) fl 0(S, m) c holds. Then there 
exists a collection Af = (A 1 , . . . , A k ), with 

suc/i t/iat Oj^(S, m) c fl A^S", m) holds. 

Proof. Recall that the event B(S,5) implies that X^Li a | < S\S\ 2 . Set = NG m +i( v j) ^ ^ 
for each j e [A;], where J(S,5) = {i>i, . . . , i>fc}, and note that therefore XljLi IA?'| 2 < ^l^] 2 - 
As noted above, this implies that Oj^(S,m) C 0(S,m). Finally, NG m+1 (v) H S C Aj for 
every v E J, and |A Gm (t>) fl S'l ^ for every i> J, so Jx(S,m) holds, as claimed. □ 

Observation 7.14. Let 5 C V{G m ) be a set with \S\ ^ (4^ +7) y/nlogn, and suppose that 
B'(S, 5) fl £(m*) fl O'('S') m T holds. Then there exists a collection Af = (Ai, . . . , Ak), with 

|AiK (2 + 5)m ^ ^^f^Sf, 



n 

i=2 
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such that Otf(S, mf H Af(S, m) holds. 

Proof. Recall that the event £{m*) implies that da m (v) ^ (2 + 5)m/n for every to G [m*], 
that B'(S,5) implies that E?=2 a i < 6 \ S \ 2 ' and that 0\ s , m ) c implies that 

2m 2 



o(G m [S\) < (('f') 



n 2 



_4+2 

e 4t . 



Thus, setting Aj = No m+1 (vj) fl S for each j G [A;], where J (5, 5) = {t>i, . . . , f^}, we have 

* ('f')-± Ct 1 ) > H-^^' * (ffl - ■ 

where the final inequality follows since ('g') — ^ 7 • (('f ) + ^p-) • Hence, since 5 = 7) 
was chosen sufficiently small, 

o„(S,m) < (G m [5]) < (l- e ) (('2') -^)e" 4t2 < (l-5)e- 4 * 2 o^,0), 

which implies Om(S, m) c . Finally, as in the previous proof, N Gm+1 (v) f] S C Aj for every 
v G J and \Na m (v) (~) S\ ^ n 5 for every v £ J, so Af(S,m) holds, as required. □ 

Finally, note that if V(S, 5) c holds, then there are at most n 2nl/2+2S different possible 
collections N given by the observations above, since there are at most n v / 2+2S edges between 
S and J. If moreover \S\ > 2y/n, then this is at most n^ n2S . 

We can now deduce Lemmas 7.9 and 7.10. 

Proof of Lemma 7.9. Suppose that B(S,5) H y(m*) H Q(m*) n £>(S,5) C holds, and let m G 
[m*] be minimal such that 0(S, m) c holds. By Observation 7.13, there exists a collection 
J\f = (A u . . .,A k ), with J2 k j= i \ A i\ 2 < $\S\ 2 i such that Oj^(S,m) c nSf(S,m) holds. We shall 
apply Lemma 7.11 for each such collection, and use the union bound. 

Since 3^ < s < n l ' 2+£ and E?=i l^jf < 5 I 5 I 2 > w e have < 0^(5, 0) < n 5 / 4 . 



Hence, by Lemma 7.11, we obtain 



w(Otf(S, m) c n A/"(S, to) n y(m) n Q(m)) < m* • n |s|nM • e" |5|n 



5 ^ e~ m \ 



where the sum is over to G [to*] and families Af as described above, which satisfy V{S, 8) 1 
It follows immediately that 



-sn 6 



F\B(S, 5) n y(m*) n Q(m*) n £>(S, 5) C J < e 

as required. □ 

The proof of Lemma 7.10 is similar, using Observation 7.14. 

Proof of Lemma 7.10. Suppose that B'(S, 5) n S(m*) n y(m*) n Q(to*) n 5) c holds, and 
let m G [to*] be minimal such that 0'(S,m) c holds. By Observation 7.14, there exists a 
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collection J\f = {A u .. .,A k ), with ^ (2 + 5)m/n and £)J =2 \ A i? < S \S\ 2 , such that 
Otf(S, m) c nJ\f(S,m) holds. We shall again apply Lemma 7.11, and use the union bound. 
Since + 7) y/n log n < s < n 1,/2+£ and 

as in the proof of Observation 7.14, it follows that I^IV^ ^ °Af(S,0) ^ n 5 / 4 . Hence, by 
Lemma 7.11, we obtain 

^F(Otf(S,m) c nAf(S,rn) ny(m) H Q(m)) ^ m* • n |5| " M • e" |5|n4 ' < e^, 

where the sum is over m G [to*] and families Af as described above, which satisfy T>(S, 5) c . 
It follows immediately that 

f(b'(S, 5) n £(to*) n y(m*) n Q(m*) n £>(S, 5) c ) ^ e~ sn *, 

as required. □ 

7.5. The events C(S,6) and C'(S,5). Recall that C(S,<5) denotes the event that sl(S,5) = 
(ai, . . . , a|j|) satisfies X]j a j ^ <5 1 tS' | 2 and J2j a j < n 1 ^ 25 , and that S is an independent set 
in G m * . The main result of this subsection is the following lemma. 

Lemma 7.15. If s ^ (\/2 + 7) y/n \ogn, then 



w(c(S, 5) n £(m*) n y(m*) n Z(to*) n Q(to*)) 



S: |S|=s 



We shall also prove the following easy lemma which shows that if £(m*) fl Z(m*) holds, 
then the event C'(S,S) fl W(S,v) does not hold for any pair (S,v) with \S\ > y/n. 



Lemma 7.16. For every S C V(G m ) with \S\ ^ y/n, and every v G V{G m ), 

C'{S,5)nW{S,v) C (£(to*) nZ(m*)) c . 

Recall that ai ^ . . . ^ a|j|, and let k = k(S, 5) be minimal such that Ylj>k a | < ^l^] 2 . We 
begin with an easy but key observation which will be used in both proofs. 

Observation 7.17. Let S C V{G m ) with \S\ > y/n. Then 

C(S,5)UC'(S,S) C {a fe >n 1/2 - 35 } n j^n 55 }. 

Proof. Noting that the event C(S, 5) U C^S", 5) implies that a,- < n 1 / 2+2<5 , we obtain 

n 1 - 5 < 5\s\ 2 ^ ^ a *X^ < a ^ 1/2+2S , 

j^k j^k 

by the definition of k. The claimed bounds on ak and /c now follow immediately. □ 
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Proof of Lemma 7.16. Fix S C V(G m ) with \S\ ^ y/n and v G V(G m ), and suppose that 
the event C'(S,5) fl W(S,v) D £(m*) fl Z(m*) holds; we shall show that this is impossible. 
Recall first that the event W(5', i>) implies that {u, v} G 0(G m *) U E(G m *) for every u £ S. 
We shall show, using the event £(m*) fl Z(m*), that every vertex u> G V{G m ) other than t> 
has at most 

n 4e + (logn) 2 (129) 

G m * -neighbours in S. Indeed, fix w G V(G rn ), and note that (as in the proof of Proposi- 
tion 3.7), since the event £(m*) holds, there are at most 

(l + o(l)) •4t*e- 4 < t *) a v ^ < n 4£ 

vertices u G S such that {w, t>} G 0(G m ») and G E{G m *). Similarly, since the event 

Z(m*) holds, there are at most (logn) 2 vertices u G S such that {u,v}, {u, w} G E(G m *). 

It follows from (129) that a 2 ^ n 4e + (logn) 2 . But J2\=2 a ) > 5s2 > since S ) holds ' and 
so k(S,5) ^ 2. Hence, by Observation 7.17, we have a 2 ^ n 1 / 2-35 , which is a contradiction, 
as required. □ 

The proof of Lemma 7.15 is considerably harder, and so we shall give a brief sketch before 
plunging into the details. We are again motivated by Observation 7.17, but since we no 
longer assume that the event W(S, v) holds (and so very many vertices can have very high 
degree into S) we shall need some extra ideas. We will ignore the 'high degree vertices' in J 
with between n s and n 1 / 2-35 neighbours in S, and focus on the set J' C J of 'very high degree 
vertices', which form an unusually dense bipartite graph H = G m *[J f , S]. Moreover, there is 
a trade-off in choosing the graph H: the more edges it has, and the earlier the edges of H 
are chosen, the less likely it is to occur, but the easier it is to keep the set S independent. 
We shall need to keep track of each of these competing influences. 

In order to show that such a structure (a bipartite graph H as described above, sitting 
on an independent set S) is unlikely to exist in G m *, we partition the space according to 
the sets S and J', the graph H, and the collection m = (m(f) : / G E(H)^j of steps of the 
triangle-free process at which the edges of H were chosen. The probability that the edge / 
is chosen in step m(f) is l/Q(m(f)) 2e 4 * /n 2 , since the event Q(m*) holds; the hard part 
will be to bound the number of 'forbidden' open edges at each step. 

The forbidden open edges come in two types: those inside S, and those which would close 
not-yet-chosen edges of H, i.e., in the set 

Y H (m) := (J Y f (m), where T{m) := {/ G E(H) : m(f)>m}. (130) 

/€T(m) 

In order to keep these sets disjoint, we shall ignore open edges in S which are closed by 
vertices in J', i.e., we shall consider the sets Oj^(S, m), where M = (Ai, . . . , Ak) encodes the 
neighbourhoods Aj = Na m * (vj) fl S of the vertices of J', cf. Section 7.4. Observe that if 
/ G Otf(S, m) l~l Y h (m) for some h G E(H) with m(h) > to, then / C N Gm „ (v) D S, where v 
is the endpoint of / in J', which contradicts the assumption that / G Oj^(S,m). It follows 
that the two sets of open edges we consider are indeed disjoint, as claimed. 
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Finally, note that we can bound the probability that oj^(S,m) is smaller than expected 
using Lemma 7.11; thus, all that remains is to bound from below the size of the set Yff(m). 
Our key tool in doing so will be the following lemma. Given a graph G and an oriented 
edge / G V(G) 2 , let d^f) and d^(f) denote the degrees of the (left and right, respectively) 
endpoints of / in G, and let 



E(G) := |J fN G (u)nN G (v)\ 

w-KT') 2 



denote the set of edges which are 'double-covered' by G, that is, those which are contained in 
the neighbourhood of at least two vertices of G. Recall from Section 5 that Y^{m) C Y e (m) 
denotes the collection of F-neighbours / of e such that the vertex v £ e \ f has label L. 

Lemma 7.18. Let m G [m*\, let H be a graph with E(H) C E(G m *) fl 0(G m ), and give an 
orientation to each edge of E(H). If Z{m*) holds then 

|J Y f L (m) 2 Y f L (m) - (\ogny(\Z(G m *[V(H)})\+ £ df(/)Y 

j<=E(H) feE(H) ^ feE(H) ' 

We remark that we shall in fact apply Lemma 7.18 to two different subgraphs, T(m) and 
T'(m) C T(m) (defined below), of the graph H = G m , [J', S}. 75 

Proof of Lemma 7.18. The lemma follows by inclusion-exclusion, together with the observa- 
tion that 

\Y f L (m)nY h L (m)\Z{G m *[V(H)})\ < d*(f)(logn) 2 . (131) 

heE(H)\{f} 

To see (131), let f,h G E(H) and suppose first that / and h are disjoint. Then (clearly) 
|y^(m) fl Y^(m)\ G {0, 1}, and moreover we claim that 

Y f L (m) nY h L (m) C ~(G m *[V(H)}) . 

Indeed, to see this simply note that /, ft G E(H) C E(G m *), so if e G Y^(m) fl Y^{m) then 
it is double-covered by G f m *[V(if)]. Note also that if / 7^ ft, but / and ft, intersect in the 
'left' endpoint of either / or ft, then Y^(m) fl Y^(m) = 0. 

Now, since Z(m) holds, we have \Y^(m) fl y^(m)| ^ (logn) 2 for every pair of edges 
/, ft G 0(G m ), by Observation 5.15. Moreover, there are at most d^(f) edges h G E(H)\{f} 
which contain the 'right' endpoint of /, and so (131) follows. 

To deduce the lemma from (131), simply note that (for any set A C E(K n )) 



U Yf(m)\A 
feE(H) 



> E \Y, L (m)\A\- \Y f L (m)nY h L (m)\A\, 

feE(H) f,heE(H) 



set A = S(G f m .[V'(if)]), and observe that, since Z(m*) holds and if C G rn *, each edge 76 of 
E(G m *[V(H)]) appears in Y f L (m) for at most (logn) 2 edges / G E(H). □ 



75 We shall also use it in Section 7.6, below. 

76 In fact, this holds for every edge, not just those in S(G m * [V(if )]). 
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We are now ready to prove Lemma 7.15. 

Proof of Lemma 7.15. Let (v^2 + r y)\/n logn ^ s ^ n l l 2+£ ; we begin the proof by breaking 
up the event whose probability we wish to bound into more manageable pieces. Indeed, 
for each S C V(G m ) with \S\ = s, each k ^ n 55 , each J' C V(G m ) \ S with \J'\ = k, 
each bipartite graph H on S U J' with dff(i>) ^ n 1 / 2-35 for each v G J', and each collection 
m = (m(f) : / G E(H)) G [m*] e ^, let C(#, m) denote the event that the following all 
hold: 

(a) 1(S, to*) n £(to*) n ^(to*) n Z(to*) n Q(m*). 

(b) {G m *[J>, S] = H}n{ Z A j, a) < 5\S\*} n { < n 1 ^}. 

(c) For each / G E(H), the edge / was added in step m(f) of the triangle-free process. 

Note that we suppress the dependence of C(H,m) on S and J', by encoding both sets in 
the graph H. By Observation 7.17, we have 

P(c{S,5)n£(m*)ny{m*)nZ{m*)nQ(m*)^ ^ ^>(C(#,m)), (132) 

5:|5|=s H,m 

where the second sum is over all graphs H as described above, and sequences m G [m*\ e ^ H \ 
As noted in the sketch above, our bound on the probability of C(H,m) will have two 
parts: a bound on the probability that the edges of H are chosen at the steps corresponding 
to m, and a bound on the number of 'forbidden' open edges at each step. Recall that an 
open edge is forbidden if it is in the set S, or if it is a V-neighbour of a still-open edge of H . 
We begin by controlling the number of open edges inside S. 

Let J' = {v i, . . . , Vk] and Af = (A±, . . . , A k ), where Aj = N H {vj) fl S for each j G [k]. 

Claim 1: With probability at least 1 — e~ sn ' iS , either C(H, m) c holds, or 

oAS, m) > ((1 - 25) Q - £ ) e-- 2 (133) 

for every to G [m*]. 

Proof of claim. The claim follows by applying Lemma 7.11 to a large family of collections 
Af' D Af, cf. Section 7.4. Indeed, we claim that if C(H, m) holds, but (133) fails to hold, 
then there exists to G [to*] and Af' = (Ai, . . . ,Ai), with Y?j=k+i IA>'| 2 < $\S\ 2 > suc h that 
Om>(S, m) c n A/ 7 ^, to) holds. To see this, set A,- = 7V" Gm „ n 5 for each k < j ^ £, where 
7(5, 5) = {f i, . . . , ve}, and observe that the event C(H, m) implies that 

£ w = £ «? < ^i 2 - 

j'=fe+i vjeJ\J' 
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It follows that 



o„,(S,m) < oAS,m) < ((i _ 2 *)g) - ^ g))-^ 



e 



< (1-5) 



E(2')) e " 4t2 < (l-^o^^O), 



assuming that (133) fails to hold for to, and so we have On>(S, m) c , as claimed. Moreover, 
note that Na m+1 (vj) H S C Aj for every j G [£] and |iVG m (t>) H S\ ^ n s for every v £ J, and 
so the event Af'(S,m) also holds, as required. 

We shall now apply Lemma 7.11 to each possible such collection Af', and use the union 
bound. Observe that the claim is trivial if the bound in (133) is negative, and so we may 
assume that 

and hence, since y/n « s ^ n 1 / 2 ^, that |<S , |v / "' ^ °Af(S,0) < n 5//4 . Recall also that, since 
j cij < n l l 2+2S and \S\ ^ 2>/n, there are at most n' 5 '™ possible collections Af' as described 
above. By Lemma 7.11, and summing over to G [to*] and collections Af' as described above, 
it follows that 



v(Ow(S,m) c nAf'(S,m)ny(m) n Q(m)) 



N',m 

as required. □ 

Next we turn to the set Yh(to), i.e., to the forbidden open edges which are y-neighbours 
of some not-yet-chosen edge of H. Recall from (130) the definition of 

T(to) = {/ G E(H) : m(f) > to}, 

and set a(/) = a,- for each edge / G E(H), where i>j is the endpoint of / in J'. The following 
claim is a straightforward consequence of Lemma 7.18. 

Claim 2: Suppose that C(H, m) holds. Then, for every cu ■ n 3//2 < m ^ to*, 

U Y)(m) > ^_5 2 W (m) | T(m) | + £ maxj Q-^F(m)-a(/)(logn) 2 , o|. 



/€T(m) 



/er(m) 



Proof of claim. Let us give an orientation to each edge / G T(m), by saying that its right 
foot is in S. The claim follows by applying Lemma 7.18 twice, first to T(m), and then to a 
suitably-chosen subset T' C T, with reversed orientations. 

Let us begin by observing that since I(S,m*) D Z(m*) holds, we have 

\E(G m ,[V(H)\)\ « (' j ') + Oog-J'P 1 ) < »°"> « »' < gjgi <134) 
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Indeed, since S is an independent set in G m * , every edge which is double-covered by edges of 
H is either inside J', or is in the common G m *-neighbourhood of two vertices of J'. The first 
inequality now follows from the event Z(m*), since each pair of vertices of J' has at most 
(logn) 4 edges in their common neighbourhood. The other inequalities follow since \ J'\ ^ n 55 , 
since 5 = 5(e) <C e, and by the definitions of Y and m*. 

We next claim that, since X(S,m*) D £(m*) n Z(m*) holds, we have 



U y i l m 




/6T(m) 





2 



Y(m)\T(m) 



(135) 



To prove (135), apply Lemma 7.18 to T(m), and use (134) to bound |S(G f m *[V'(if)]) |. Since 
we have Y f L (m) G (§ + o(l))F(m), by £(m*), and < |J'| < n 5<5 < f (m)/(logn) 2 for 

every / G E(H), we obtain (135), as claimed. 

Finally, recall that a(f) = djj(f) for each edge / G E(H), and define 



T'(m) = < / E T(m) : - - 5 2 ) Y(m) > a(f)(\ogn) 



We claim that 



U 

/€T'(m) 



m) 



^ E^ max 

/er(m) 



5 2 y(m) - a(/)(logn) 2 , 



(136) 



This holds trivially if \T'(m)\ = 0, so assume not and apply Lemma 7.18 to T'(m). Using 
the event £(m*) and (134), we obtain 



U y, 

/ST'(m) 



Ri 



m 



5 2 Y(m)\T'(m) 



E a (/)( lQ g 

/£T'(m) 



which immediately implies (136). Combining (135) and (136), and noting that Y^(m) D 
Y^[m) = for every f,h & E(H), since 5 is independent, the claim follows. □ 

We are ready to prove our desired bound on the probability of the event C (H, m) . 



Claim 3: For every bipartite graph H on S U J' , with j^l 



s. 



\J'\ 



k < n 5S and 



d H (v) ^ n 1 / 2 35 for each t> G J', and each collection m = (m(f) : / G E(H)} G [m*] e ( H \ 



P(C(tf, m)) < exp ( — E aj - U + I 2 js log n J] 

j=i V / / j=1 



a 3 



4-4<5 



+ e 



n 3S 



Proof of claim. If C(H,m) occurs, then at each non-m step of the triangle-free process we 
do not choose a forbidden open edge, and at step m(f) we choose edge /, for each / G E(H). 
By Claim 1, the probability that C(H, m) occurs and (133) fails to hold is at most e~ sn , 
so let us assume from now on that 

k 



o x {S,m) > (1-25) 



E 



-At 2 
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for every m G [m*]. Moreover, by Claim 2, we have 

\Y H (m)\ > (\~ A Y(m)\T(m)\ + £ max j (± - Y(m) - a(f)(\ogn) 2 , j 

V ' /6T(m) IV/ ) 

for every to ■ n 3//2 < m ^ to*, where Yh{tti) was defined in (130). Moreover, as noted earlier, 
Ojv(S,m) and Y/j(to) are disjoint sets of forbidden open edges, since if / e Oj^{S,m) n 
Y#(to), then / is contained in the G m *-neighbourhood (in 5) of some vertex of J' (that is, 
in one of the sets A,- e A/"), which contradicts the definition of Ox(S,m). 

It follows that the number of forbidden open edges at step to + 1 > oo ■ n 3 / 2 is 

o M (S,m) + \Y H (m)\ > e" 4 ' 2 max |(1 - 28) Q - £ Qf) , o} 

+ Q - 5 2 ) F(to)|T(to)| + £ max { Q - 5 2 ) F(m) - a(/)(logn) 2 , o}, (137) 

V / /GT(m) IV/ J 

and hence the probability of choosing a forbidden open edge 77 is at least the right-hand side 
of (137) divided by Q(m). Let us take the three terms on the right-hand side of (137) one 
at a time. Indeed, since Q(m*) holds, and setting m = \u ■ n 3 / 2 ] , we have 



m=mo v N ' J=l x ' y J=l 

and, recalling that £(/) = to(/) • n~ 3 / 2 , 

^ F(m)|T(m)| ^ ^ (8 - S)m \^ 1 (8 - £)m ^ + ( f \2 

2- ofm) — ^ 2- 2- n 3 =2^2^ n 3 > (4 - 5) ^ f (/) . 

m=l MV ; m=l/eT(m) fGE(H) m=l feE(H) 

In order to bound the final term in (137), let us write i(f) for the time u < t < t* at which 
F(m) = C • a(/)(logn) 2 , if such a time exists, and set i(f) = or i(f) = t* otherwise, 78 in 
the obvious way, and set to(/) = i(f) ■ n 3 / 2 . We obtain 



EgT^y E max{Q-^F(m) - a(/)(logn) 2 , o} 



/6T(m) 



min{m(/),m(/)} , . 

>E E U- 2 * 2 ) 5 > < 2 " f ) E ™« {«/).<(/)}'• 

fGE(H) m=l ^ ' feE(H) 



77 Note that, if this probability is p, then the probability of not choosing such an edge is at most e p , and 
the choices at each step are independent. 

78 0f course, if Y(u ■ n 3 / 2 ) < C ■ a(/)(logn) 2 then set t(f) = 0, and if Y(m*) > C ■ a(/)(logn) 2 then set 
i(f) = t*. Note for future reference that we have e" 4 *^ 2 < a(f) ■ e^**) 2 , since e 4 (**) 2 < n 1 / 2 ^. 
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Combining the last several inequalities, it follows that the probability that we avoid choosing 
a forbidden open edge at every (non-m) step of the triangle-free process is at most 79 



cxp 



m 



E «?-(!- ^(^+2 E (t(f) 2 +™*W),t(f)} 2 )) 

i=i v feE(H) J 



(138) 



Finally, note that, since Q(m*) holds, the probability that we choose the edge / at step 
m(f) for each edge / G E(H) is 



TT 1 



(139) 



feE(H) 



It follows that the probability of the event C(H,m) is at most the product of (138) and 
the right-hand side of (139). The remainder of the proof is a straightforward calculation. 
Indeed, note first that 

V (H,m) := (1 + 35) t(f) 2 -(l-35) min {£(/), t(/)} 2 (140) 

fdE(H) feE{H) 

is increasing in t(f) for every / G E(H). Moreover, recalling that i(f) depends only on 
a(f) = dji(v(f)), where v(f) is the endpoint of / in J', we may define t(vj) = £(/) for any 
edge / G E(H) which is incident to Vj, for each j G [k]. It follows that 

feE(H) j=l 

since each vertex v 3 - is counted exactly du{vj) = a 3 times in the sum on the left. Since 
f (/) < t* and i(f) < t* for every / G E(H), and e(H) = Y? j=1 aj, it follows that 



K 

j'=i 



Recalling that e 4 *(^') 2 ^ a j • e 4 (**) 2 ; and noting that e 24<5 (**) 2 < n 3<5 , it follows that 



(141) 



Now, multiplying (138) by the right-hand side of (139), and using (141), we obtain 



P(C(#,m)) < 



n(« 3i ^ i/2 



exp 



E«?-(i-<0 



m*s 2 



79 T 



Here we use the fact that X^mli ^'qfro)"^ ^ 0(uj 2 ) ■ e(H), which is swallowed by the error term. 
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Finally, observe that 



exp ^ - (1 - e) 1 ^- S j < exp ^ - Q + 7 2 ^ slognj, 



since s ^ (\/2 + 7) y/n logn. It follows that 



P(C(tf,m)) *C J] 

j'=i 



a 



4-45 



n 



a.j/2 



exp 



n 



2 



+ 7 2 ] s log n 



+ e 



as claimed. 



(142) 



□ 



We are finally ready to sum the probability of C(H, m) over H and m. To simplify the 
counting, let us first fix k and a. Note that we have (") choices for S, at most n k choices 
for J' (given k), at most rij=i (a ) cn °i ces f° r H (given S, J' and a), and at most (m*) 6 ^ 
choices for m. Note that, since e(H) ^ n 1 / 2+2<5 , we may disregard the final term in (142). 

Since e(H) = J2j=i a v we have 



y(H) 

i=i 



dj I \n 



4-4<5 



dj/2 



exp 



m 
n 2 



E«0 < n 



j'=i 



es m Jdj 

2~=2~5~ ' eX P 



and hence, since sm* ^ n 2 log n, it follows that 



80 



£>(C(ff,m)) < -exp - - + 7 2 alogn J] 



A r n 3S 



j'=i 



exp 



m Oj 
n 2 



m Oj 



But since £(m*) holds, we have Oj ^ (4= + 5) ^nlogn for each j G [A;], and thus 



n 



38 



■ exp 



ma 



>V2 

€■ — exp 



n 



1/4 



e 2 logn < 1. 



Hence, summing over k ^ n 5 and sequences a = (ai, . . . , a&), we obtain 

^P(C(tf,m)) < (") j>* • n fc • exp ( - Q +7 2 ^ s logn) 



- » 

fc=i 



2A- 



en 



n 



fe=i 



since s ^> y/n, as required. By (132), the lemma follows. 



□ 



Here the sum is over those H and m with the given values of k and a. 
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7.6. The event V(S,5). Recall that V(S,S) denotes the event that S is an independent 
set in G rn * and X]j a j ^ n 1 / 2+2<5 . In this subsection we shall bound the probability that the 
event T>(S, S) occurs for some S C V(G m ) with IS*! ^ ^Jn\ogn. Unlike in the previous three 
subsections, it will not suffice to use the union bound over sets S; nevertheless, we shall 
prove the following bound. 

Lemma 7.19. If s ^ n 1 / 2+5 ^ 7 then 

P ( U ^(S,5)n£(m*)nZ(m*)nQ(m*yj ^ n~^. (143) 

The idea of the proof is as follows: if X]j a i ^ n 1 / 2+2<5 , then the bipartite graph G m *[S, J] 
contains a subgraph H = G m *[S*,J*] whose appearance in G m * is highly unlikely. This 
subgraph will have the following two properties: 

(a) e(H) ^ n 1 ^ 2+5 and (b) H is 'close to regular on both sides', 

i.e., (1h{u) / du{v) = 6(1) for every u,v G 5", and also for every u,v G J'. (This regularity 
property will be useful to us twice: in bounding the number of forbidden open edges at each 
step, and in the final calculation.) The calculation required to bound the probability that 
such a graph H occurs in G m * is similar to (but somewhat simpler than) that in the previous 
section; the main difference is that we cannot control the number of open edges inside S, 
and so the only forbidden open edges are those in Y H {m). 

We begin with a straightforward (and probably well-known) lemma. 

Lemma 7.20. Let G be a bipartite graph on vertex set AU B with e{G) ^ v{G), and let 
v G (0,1). There exist sets A* C A and B* C B such that the induced bipartite subgraph 
H = G[A*, B*] has the following properties: 

e(H) > , A H (A*) <: 2 7 /»5 H (A*) and A H (B*) <: 2 7 /»5 H (B*), (144) 

where A H (S) = max{ci^(-u) : u G S} and Sh(S) = min{<i^(-u) : u G S}. 

In the proof of Lemma 7.20 we will use the following simple (and standard) observation. 

Observation 7.21. Let G be a bipartite graph on vertex set A\JB. There exist sets A* C A 
and B* C B such that the induced bipartite subgraph H = G[A*,B*] has the following 
properties: 

e(i/)>^, M^)>^ and S S (B1 > *f , 
where dc(A) (resp. da(B)) denotes the average degree of a vertex of A (resp. B) in G . 

Proof. Simply remove, one by one, vertices in A with degree at most dc{A)/A in the remaining 
graph, and vertices in B with degree at most dG{B)/4. Since we clearly remove at most 
e(G)/2 edges in total, this process ends when we reach the desired subgraph H. □ 

We can now prove Lemma 7.20. 



\l-v 
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Proof of Lemma 7.20. Note first that, by Observation 7.21, it will suffice to find an induced 
subgraph H = G[A*, B*\ with 

e(H) ^ -A-l , A H (A*) < 2 5/u d H (A*) and A H (B*) < 2 5/u d H (B*). (145) 

We shall therefore aim to satisfy the properties (145) instead of (144). 

Let A* = A' \ A" and B* = B' \ B", where A', 5', A" and B" are defined as follows. Set 
£ — 2 /is, and let A' C A and B' C 5 be a pair of subsets which maximizes 

u{A , BI) = 

Now let A" C A' and £" C B' denote the high degree vertices of H' = G[A',B'], i.e., 

A' = {u e A' : d H ,(u) ^ 4^ • and S" = {u e B' : d H ,(u) > 4 £ ■ d H >(B')}. 

In order to see that the subgraph H = G[A*,B*} satisfies the properties (145), observe 
first that, by the maximality of the pair (A, B'), we have 

w(A',B') ^ w(A,B). 

We claim that therefore e(H') > e{G) 1 ~ l> . Indeed, we have 

e(H') = e(G[A',B']) > (Jj^jfj 1 ) ^ e(G) > ^ > e{Gf 

since v(G) < e(G) and £ = 2jv. Moreover, note that we have A H (A*) < 4 £ • d?j/(A') and 
A H (B*) ^ 4 e ■ du'(B'), by construction. 

We claim that e(H) ^ e(H')/2, which, together with the bounds proved above, implies 
the inequalities (145). Indeed, by counting edges we have 

4 e 'd H >(A')-\A"\ ^ e(A", B') < e(A',B') = d H >(A') ■ \A'\, 
and so \A"\ < 4~ e \A'\ and similarly \B"\ < 4~ £ | J B'|. It follows that 

by the maximality of {A, B'). Using Jensen's inequality, we obtain 

e(A",B')+e(A',B") <: 2^ • (e(A", B') e + e(A', B") E ) ^ <: B ^ B '\ 

and hence e(H) ^ e(H')/2, as required. □ 

We are now ready to prove Lemma 7.19. 

Proof of Lemma 7.19. Let S C V(Cr m ) be a set of size s ^ n 1 / 2 ^ 3 , and recall that we denote 
by J = J(S,5) = . . . , Vk} is the set of vertices with at least n 5 G m * -neighbours in S, 
and set aj = \NG^(vj) fl S\ for each j e [A;]. Suppose that the event X]j a j ^ n l l 2+2& occurs 
for S; we make the following deterministic claim. 
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Claim 1: There exist sets S* C S and J* C J such that the induced bipartite subgraph 
H = G m *[S* : J*} has the following properties: 

e(H) > n l/2+ \ n s/2 ^ A H (S*) ^ C ■ 5 H (S*) and n s/2 <: A H (J*) ^ C ■ 5 H (J*). (146) 

Proof of claim. Simply apply Lemma 7.20 to the graph G m *[S, J], and recall that we chose 
C = C(e, 5) to be sufficiently large as a function of 5. □ 

Now, using Claim 1, let us break up the bad event in (143) into more manageable pieces, 
as follows. For each S*, J* C V(G m ), each bipartite graph H on S* U J* satisfying (146), 
and each collection m = (m(f) : / G E(H)) G [m*] e< -- H \ let D(H,m) denote the event that 
the following all hold: 

(a) 1(S, rrf) n £{rrf) n Z(m*) n Q(m*). 

(b) {G m .[S*,J*] = H}. 

(c) For each / G E(H), the edge / was added in step m(f) of the triangle-free process. 

As in the previous subsection we suppress the dependence of D(H,m) on S* and J*, by 
encoding both sets in the graph H. By Claim 1, we have 

P( [J V(S,5)n£(m*)nZ(m*)nQ(m*U ^ ^F(D(H,m)), (147) 

^S:|5|=s ' H,m 

where the sum is over all graphs H as described above, and sequences m G [m*] e( - H \ 

Our bound on the probability of D(H,m) is similar to that of C(H, m) in the previous 
subsection. It again consists of two parts: a bound on the probability that the edges of H 
are chosen at the steps corresponding to m, and a bound on the number of 'forbidden' open 
edges at each step. This time, however, an open edge is forbidden only if it is in Y H (m), i.e., 
if it is a V-neighbour of a still-open edge of H . 

In order to bound the number of forbidden edges, we shall need the following claim, which 
follows by modifying 81 the proof of Lemma 7.18. We again define 

T(m) = {/ G E(H) : m(f) > m}, 

and let d H (f) = d L H (f) + for each edge / G E(H). 

Claim 2: Suppose that D(H, m) holds. Then, for every u ■ n 3//2 < m < m*, 

|J Y f (m) > max{(l-5 3 )f(m)-^(/)(logn) 3 , 0}. 



/er(m) 



/6T(m) 



Proof of claim. We consider, for each open edge / G T(m), the following subset of the Y- 
neighbours of /: 

Yf(m) := j/i G l/(m) : {-u,w} ^ E(H), where u = f\h and v = h \ /j. 



81 Observe that, since we have no upper bound on | J*|, we cannot control the size of the set s(G m « [V (i? )]) , 
and so it is not sufficient to simply apply Lemma 7.18. 
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In words, we consider only those F-neighbours of / in G m such that the step in the F-graph 
(from / to h, say) does not use an edge of H . The motivation for this definition is that 
Yp{m) ^ Yf{m) — dn(f), and if / and h are disjoint edges of T(m), then 

Yf (m) n Y^{m) = 0. (148) 

To see (148), let e G Y^(m) fl Y^{m), and note that either e has an endpoint in each of S* 
and J*, which is impossible because S is independent, or e is contained in either S* or J*, 
which is impossible because H = G m *[S*, J*] is the bipartite graph induced by S* U J*, and 
so the edge linking e and / in the F-graph must be an edge of H . 

It follows from (148), together with the proof of (131) in the previous subsection, that 

£ \Y f H (m)nY h H (m)\ ^ d H (f)(\ogn) 2 . (149) 

heT(m)\{f} 

Indeed, since Z(m) holds we have \Y^{m) fl Y^{m)\ ^ (logn) 2 for every pair of edges 
f,h e 0(G m ), by Observation 5.15, and there are at most d H (f) edges h £ T(m) \ {/} 
which intersect /, so (149) follows. 

The claim now follows by inclusion-exclusion. Indeed, let us define 

T\m) = |/GT(m) : (l - 5 3 )Y(m) > ^(/)(logn) 3 |, 

and observe that, by (149), and since the event £{m*) holds, 



U y /K 

/6T(m) 



U *7< 

/ST'(m) 



> E l*/V)| 

/eT'(m) 



£ |if(m)ny^(m)| 



f,heT(m') 
f¥=h 



> E ((i-5 3 )%)-4(/))- E ^(/)( lo g 

feT'fm) /eT'(m) 

> ^ max{(l-5 3 )f(m)-^(/)(logn) 3 , o} 

/6T(m) 



for every w • r?l 2 < m ^ m*, as required. 

We are ready to prove our desired bound on the probability of the event D(H, m). 
Claim 3: For every bipartite graph H satisfying (146), and each collection m e [m*] e( - H \ 



□ 



F(D(H,m)) ^ 



2+<5 2 



n d n(/). 



(150) 



Proof of claim. If -D(i/, m) occurs, then at each non-m step of the triangle- free process we 
do not choose a forbidden open edge, and at step m(f) we choose edge /, for each / e E(H). 
By Claim 2, there are at least 



max{(l-5 3 )r(m)-^(/)(logn) 3 , 0} 



feT(m) 
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forbidden open edges in G rn , for every cu ■ r?l 2 < m ^ m*. 

Similarly (but not identically) as before, let us write t(f) for the time t > oj at which 
Y(m) = C ■ c?^(/)(logn) 3 , if such a time exists, and set i(f) = otherwise. Note that the 
degree dnif) of an edge is the same up to a factor of C for all edges of H, by (146). Since 
the event Q{m*) holds, we have 



TO* 

Eq^) E max{(l-^)y(m)-rf H (/)(lo gn ) 3 ,0} 

E E ( 8 -°(*'))^ > ( 4 -0(*')) E min {*(/), t(/)} 2 , 



/GT(m) 

min{m(/),m(/)} 

feE(H) m=l feE(H) 

where rh(f) = t(f) -n 3//2 , and hence the probability that we avoid choosing a forbidden open 
edge at every (non-m) step of the triangle-free process is at most 

exp(-(4-5 2 ) ( min {*(/)• t(f)} 2 -" 3 )), (151) 

since J2m=i Z)/eT(m) §[^y ^ 60,3 ' e (-^0- Next, note that since Q(m*) holds, the probability 
that we choose the edge / at step m(f) for each edge / G E(H) is 

It follows that the probability of the event D(H,m) is at most the product of (151) and the 
right-hand side of (152). Note that this product is increasing in t(f) for each / G E{H), so 
we may set t(f) = t*. Since e 4 ^*) 2 < n 1 ^ 2 ^ 26 , we obtain 82 

P(D(H,m)) < (^Y^expf-^-* 2 ) E min {£(/), r} 2 ). (153) 

Suppose first that t(f) ^ t* for every / G E(H). Then (150) follows easily from (153), since 
we have e -4 *^ ^ dn(f) • n _1 / 2+<5 for every / G E(H), and hence 

/ 1 V^) / ^l-S / 1 \ e W 

p(B(//,m))< n (^(/)'«- i/2+i ) « bL n <w>. 

as claimed. On the other hand, if i(f) > t* for some / G E(H), then £(/i) 2 > (t*) 2 - C for 
every h G by (146). It follows from (151) and (152) that 

e(H) , \ / i \ e W 



P(U(//,m)) < (^) exp(V Y. '</) 2 ) < (^) II <«/). 



82 Notc that e 4 " 3 <C n 15 , which is easily swallowed by the error term. 
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since t(f) ^ t* and d H (f) ^ 5(H) ^ n 5/2 for every / G E(H), as required. □ 

We are finally ready to sum the probability of D(H,m) over H and m. Fixing \S*\, \J*\ 
and e(H) (we will sum over these at the very end), observe that we have at most 



n \ n 



S* 1 1 J* \ \ ( 3 1 S* 1 1 J* I \ 



\S*\J \\J*\J V e(H) J V e(if) 
choices for iJ, since e(H) > (IS**) + | J*|) logn. Note also that, by the AM-GM inequality, 

n Mr, < e «/>f < (^pf . 

feE(H) v v ; feE{H) 7 

where the second inequality follows since H is almost regular. Hence, by Claim 3, 

£P(fl(*,m))< E (^•"••^'^) e< ' ,, «E«- iM " , <«^ 

#,m \S*\,\J*\ V V ' 11/ e(H) 

e(H) 

since |S'*| ^ s ^ n 1 / 2+<53 and e(i/) > n 1 / 2 ^, as required. By (147), the lemma follows. □ 

7.7. The proof of Propositions 7.1 and 7.2. The two main propositions of this section 
follow easily from the lemmas above. Recall from (120) and Definitions 7.4 and 7.5 the 
definitions of the events W(S, v), A'(S, 5), B'(S, 5), C'(S, 5) and V(S, 5). 

Proof of Proposition 7.1. Set s = (^4= + 7) V n logra, and observe that if A(G n; A) ^ s, then 
the event 

T(S,v) = {N(v) = S inG„, A }, 
holds for some v G V r (G7 m ), and some S C \^(G m ) with j^l > s. Now, by (122), we have 

[J T(S,v) nS(m*) C (J (U'(S,6) UB'(S,<5) UC'(5,i) UP(S',<J)) n W(S», 

|S|3ss |5|=s 

and thus, setting F(S, 5) = A'(S, 5) U B'(S, $)UC'(S, 6)UT>(S, 5), it will suffice to show that 
p ( U ^(^) nw(5,») n%*) n^K) fl2(m*) fl Q(m*)) ^ e~^\ (154) 

To prove (154), recall that by Lemma 7.8, we have 

w(a'(s,6) n W(S,v) nS{m*) n Q(m*)) < n - * 8 , 

5:|5|=s 

by Lemma 7.10, we have 

^ p(B'(5^)n£(m*)nHm*)nSK)n^,(5)j < e~ sn \ 

S:\S\=s 

by Lemma 7.16, we have 

C'(S,5)nW(S,v) C (£(m*) n2(mf, 



116 GONZALO FIZ PONTIVEROS, SIMON GRIFFITHS, AND ROBERT MORRIS 

and by Lemma 7.19, we have 

P ( U V(S,5)n£(m*)nZ(m*)nQ(m*y) ^ n-^. 

^S:\S\=s ' 

This proves (154), and thus completes the proof of Proposition 7.1, and hence of Theorem 1.1. 

□ 

The deduction of Proposition 7.2 is similar. Recall from Definition 7.4 the definitions of 
the events 1(S, m), A(S, 5), B(S, 5), C(S, 5) and V(S, 5). 

Proof of Proposition 7.2. Set s = (y/2 + 7) y/n\ogn, and recall that, by (119), 

[J A{S, 5) U B{S, 5) U C(S, 5) U V{S, 5) D (J l(S,m*). 

SCV(G m ):\S\=s SCV(G m ):\S\=s 

Thus, noting that a(G m *) > a(G nA ), and writing ^(8,6) = A(S,5) U B(S,5) U C(S,5) U 
T>(S, 5), it will suffice to prove that 




Now, by Lemma 7.6, we have 

P(^,5)nQ(m*)) < n- 5s . 

S:\S\=s 

by Lemma 7.9, we have 

f(B(S,5)ny(m*)r]Q(m*)r]V(S,5) c ^ ^e- snS . 

S:\S\=s 

by Lemma 7.15, we have 

v(c{S,5)n£{m*)ny{m*)nZ{m*)nQ(m*)^ ^n- 5s . 

S:\S\=s 

and by Lemma 7.19, we have 

P ( U V(S,5)r\S(m*)r\Z(m*)r\Q(m*yj ^ n-^ 1 . 

^•S:\S\=s ' 

It follows that 

P((a(G n , A ) > s) n £(m*) n y(m*) H Z(m*) n Q(m*)) < e~^\ 
which completes the proof of Proposition 7.2, and hence of Theorem 1.2. □ 
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THE TRIANGLE-FREE PROCESS AND R(3,fc): APPENDIX 



GONZALO FIZ PONTIVEROS, SIMON GRIFFITHS, AND ROBERT MORRIS 



Abstract. This file contains some of the straightforward but technical calculations from 
the paper "The triangle-free process and R(3, fc)" which were omitted from the proof in 
^— I order not to distract from the main argument. We have gathered these calculations here in 

order to save the interested reader the trouble of reproving them herself. 

<N 

<D 

I. Introduction 

in 

This short file is an Appendix to the paper [2]. In that paper we follow the triangle- free 

, , process to its asymptotic end, and show that it is an excellent 'Ramsey graph', in the sense 

Q that it gives very strong lower bounds on the Ramsey numbers R(3, k). 

Recall from [2] that we denote by G n .A the (random) maximal triangle-free graph on 
{1, . . . ,n} obtained via the triangle-free process. The main results of [2] were as follows: 1 

+-> 

£h Theorem 1.1. 



with high probability as n — >■ oo. 

^ The Ramsey number i?(3, k) is the smallest integer n such that every red-blue colouring 

of the edges of the complete graph K n contains either a red K k or a blue triangle. 

(N) Theorem 1.2. 

8 R(3,k) 2 ^-o(l) A 1 



4 / log A; 



as k — > oo. 



^ The basic heuristic behind Theorems 1.1 and 1.2 is that, with high probability, the graph 

G m obtained after m steps of the triangle-free process approximates (in a certain sense) 
the Erdos-Renyi random graph G ntm , except in the fact that it contains no triangles. More 
precisely, there exists a (large) collection S of variables all of which take (approximately) 
the values one would expect in 67„ )m , and all of whose derivatives at time t — m- n~ 3//2 may 
be bounded by functions which depend only on the values of variables in S at time t. We 
showed that moreover almost all of these variables exhibit a certain 'self-correction', and 
were thus able to control their evolution with a fairly high degree of precision. 



Date: February 27, 2013. 

1 We remark that when we restate results from [2] we shall use the numbering of that paper, whereas new 
statements will be given the prefix 'A'. 
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In this file we shall give the details of various straightforward but technical calculations 
which were omitted from [2] due to considerations of space and aesthetics. More precisely: 

• In Section 2 we shall give an extended version of [2, Section 3.4], prove a generalized 
version of Lemma 3.9, and prove Lemma 4.23 and Proposition 5.5. 

• In Sections 3 and 4 we shall do the same for [2, Section 4]; in particular, we shall 
prove Propositions 4.55 and 4.56 in Section 4. 

• In Section 5 we shall derive the equations which govern the 'whirlpool' of [2, Sec- 
tion 6], and prove Lemma 6.7. 

• Finally, in Section 6, we shall (for completeness) adapt the proof (from [3]) of 
Lemma 3.1 to our setting. 



2. Section 3.2: Tracking the variables X e 

Recall that C = C(e) > is chosen sufficiently large, and that 

g y (t) = e^V^Qogn) 4 and 9x (t) = Cg y (t). 

Set a = u ■ n 3 / 2 and define, for each to G [to*], 

K x (m) = £{m) n x(a) n y{m) n Q(to). 

In this section we shall prove Lemmas 3.9 and 3.11 of [2], which were used in [2, Section 3] 
to prove the following proposition. 

Proposition 3.8. Let oo ■ n z l 2 < m < m* . With probability at least 1 — n - Glo & n either 
K x {m — l) c holds, or 

X e (m) G (liCe^V^Qogn) 4 ) • 2e" 8 ' 2 n = (l ± g x (t))X(m) (1) 

for every open edge e G 0(G m ). 

Let (W,A) denote the graph structure pair with v(W) = 4, va(W) = 1, e(W) = 1 and 
o(W) = 2. We shall use the following two immediate consequences of the event £(to): that 

X e (m) ^ 2 • X(m) 

for every open edge e G 0(G m ), and that 

N ( p{W){m) ^ max {4te~ 8t y/n, (logn)^} 

for 6V6ry co • 

n 3/2 < m ^ m * anc j ever y faithful map 0. 
The first step is to show that the variables X e are self-correcting as long as the event 
/C(to) = lC x {m) fl X{m) holds. Define 

Xjjn) - X{m) 

X p (m) = = , 

g x (t)X(m) 

the normalized error. We shall prove the following lemma. 
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Lemma 3.9. Let uj • n 3/2 < m < m* . If £(m) fl X(m) D y(m) n Q(m) /ioWs, £/ien 

E[AX:(m)] G -^(-x:(m)± e ) 

/or every e G 0(G m ). 
It is easy to see that 

E[AX e (m)] = £ (l7(m) + l), (2) 

Vl ; /£X e (m) 

for every e G 0(G m ). Indeed, for each open triangle T in G m containing e, the probability 
that one of the open edges (/ and h, say) of T other than e is closed (or chosen) in step 
m + 1 is equal to 

\Y f (m)UY h (m)\+2 _ Y f (m) + Y h (m) + 2 



<2(m) 

To see this, simply note that if Yf(m) fl Yh{m) ^ 0, then the endpoints of e have a common 
neighbour in G m , which means that e ^ 0(G m ), a contradiction. Since X e decreases by two 
for each open triangle which is destroyed, (2) follows. 

In order to deduce Lemma 3.9 from (2), we shall prove the following, more general state- 
ment. We also use this more general version in [2, Section 7.4]. 

Lemma A. 2.1. Let A(m) be a random variable which denotes both a collection of open edges 
of G m , and the size of that collection. Suppose that 



E[AA(m)] G £ Y f (m) (3) 



±o(l) 

feA(m) 

for some £ G N and every u ■ n 3 ^ 2 < m ^ m* , and set A{m) = e~ 4ft2 A(0) and 

A*{m) = A(m) T i(m) , 
g(t)A(m) 

for some g: (0,t*) — > 1R + which satisfies g(t) ^ g x (t) and g ~ g x ? Then 

E[AA*(m)} G -±L(-A*(m)±e). 

for every to ■ n 3 ^ 2 ^ m ^ m* such that y(m) fl Q(m) holds and \A*(m)\ ^ 1. 

We emphasize that £ is an absolute fixed constant; in fact, in our applications we shall 
need to consider only the cases £ = 1 and £ = 2. In the proof of Lemma A. 2.1, we shall use 
the product rule, which was stated in [2, Section 4] 



2 We write g ~ g x to indicate that g(t) = A(n) • g x (t) for some function X(n). Since g(t) ^ g x (t), we have 
A(n) > 1 for every neN. 
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The product rule. For any random variables a(m) and b{m), 

E[A(a(m)b(m))] = a(m)E[Ab(m)] + b(m)E[Aa(m)] +E[(Aa(m))(A6(m))]. 
In particular, if a(m) is deterministic, then 

E[A (a(m)b(m))] = a(m)E[Ab(m)] + Ao(m) (b(m) + E[A6(m)] ) . 

To simplify the calculations below, we shall write a pa b to denote that the inequalities 
a/b e l±0(l/n) hold. 

Proof of Lemma A. 2.1. Observe first that, differentiating with respect to t, we have 

AA(m) pa ■ A{m) and A{g(t)A(m)) pa - ^ • g(t)A(m). (4) 

We claim that if y(m) D Q(m) holds and |A*(m)| ^ 1, then 
E[AA(m)} -AA(m) e £ ^7( m ) + S • 



c 



(l - (1 ± 2g y (t)) (1 + ^)A-(m))) • (t ± o(l)) ■ ' A{m) 

"V / g(m) 

Indeed, this follows since g q (t) <C <?,,(£) ^ e z g x (t) ^ £ 3 g(t) and u < t ^ t*. Thus 

E[AA(m)] ~ Ai(m) 8£t et 

Now, since A(m) — A(m) = g(t)A(m) ■ A*(m), by the product rule we have 

E[AA(m)] - AA(m) A(g(t)A(m)) , ,^r A ,*, ,\ ^ 

= E[AA (m) + M (m) + E M (m) ). (6) 

#(t)A(m) #(t)A(m) V / 

Combining (4), (5) and (6), we obtain 

E[AA (m)] e - • + ■ A*(m) ± 



n 
At 

C 



A*(m)±e), 



n 3/2 

as required. □ 
Recall next the following bound on \AX e {m)\, which was proved in [2]. 
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Lemma 3.10. Let u ■ v?l 2 < m ^ m* . If £(m) holds, then 

|AX e (m)| ^ max{Ate' st2 ^,(\ogn) w } 

for every e G 0{G m ). 

Using Lemmas 3.9 and 3.10, we can bound \AX*(m)\ and E[|AX*(ra)|]. 
Lemma 3.11. Let u ■ n 3/2 < m < m* . If £(m) n X(m) D y(m) n Q(m) holds, then 

\AX* e (m)\ ^ — max {te^ 2 ^, (log nf} and E[|AX e *(m)|] < ■ ^ 

g x (t) n g x (t) n > 

for every e G 0(G m ). 

In order to deduce Lemma 3.11 from Lemma 3.10, and also in Section 5, below, we shall 
use the following result, which was stated (but not proved) in [2]. 

Lemma 4.23. Let A{m) be a random variable, let A(m) and g(t) be functions, and set 

A*(m) = A(m) T i(m) . 

g(t)A(m) 

If\A{m)\ < (l + g(t))A(m), 

\AA(m)\ < • A(m) and \ A(g(t)A(m)) | « ^ • g(t)A(m), (7) 

then 

1 1 )l " \g(t)A(m) g(t) n*/*J 
Proof. Observe first that, for arbitrary functions a, b, c: N ->■ R+, if a*(m) = a(r " c ) ( ~ b ) (m) then 
Aa(m) — Ab(m) = A[a* {m)c{m)) = c(m + l)Aa*(ra) + a*(m)Ac(m), 

and hence 

Aa(m) |A6(m)| • c(m) + (a(m) + 6(m))|Ac(m)| 

Aa (to) G — — ± -, — r -, — . (8) 

c(to+1) c(to)-c(to + 1) 

Applying (8) to the functions A(m), A(m) and g(t)A(m), and using the assumptions that 
A{m) < (1 + g(t))A(m) and 

|Ai(m)| « ^-i(m) and |A(^)i(m)) | « ^ • ^)i(m), 



we obtain 

Vs(i)^(m) 9(f) " 3/2 7 
as claimed. □ 

Specializing to the case X e , we obtain the following easy corollary. 



Lemma A. 2. 2. For every u ■ n 3 ^ 2 < m ^ m* , if X(m) holds, then 



|Ax; M l^-f |AXe(m)l + l0gn 



g x (t) V X(m) n 3 / 2 



for every e G 0(G m ). 



Proof. We apply Lemma 4.23 with A(m) = X e {m) and g(t) = g x (t). Since X(m) is equal to 
e~ 8 * times a function of n, and g x (t)X(m) is equal to e~ 6 * times a function of n, we have 

a ,~w x — 16£±o(l) ~, N . / , s ~, ,n — 12i ± o(l) , s ~ , s 
AX(m) G • X(m) and A(^(t)X(m)) G • g x (t)X(m), 

and hence 

\AX(m)\ « l ^-X(m) and |A(^(t)X(m)) | « ^ ■ g x (t)X(m). 

Moreover, the event X{m) implies that X e (m) ^ (l + g x (t))X(m) for every e G 0(G m ), and 
so 

, * */ m 2 /|AX e (m)| logn 
AX T m ^ — - • 1 ~ el /' + 



g x (t) V X(m) n 3 / 2 
as claimed. □ 

We can now easily deduce Lemma 3.11. 

Proof of Lemma 3.11. By Lemmas 3.10 and A. 2. 2, we have 

iAx: M i^^ |AXe(m)l + logn 



g x (t) V X(m) n 3/2 
< 1 (max{4te~ 8 * 2 v^,(logn) w } + l ° gU 



g x (t)X(m) I v" 



D 8t 2 



C e 8 

< ttv • max {te~ 8t ^, (logn) w }, 

9x\t) n 

as claimed. Moreover, observe that, by (2), and using the event A'(m) fl 3^(m) D Q(m) and 
the fact that X e is decreasing, we have 

fex e (m) ^\"<>) 
Thus, by Lemma A. 2. 2, we have 

2 / 12* logra\ C logn 



9x(t) W /2 n3/2 J 9x(t) n 3 / 2 ' 
as required. □ 
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2.2. The proof of Proposition 5.5. We take this opportunity to prove a similar proposi- 
tion from [2, Section 5]. 

Proposition 5.5. Let uo ■ n 3 / 2 < to ^ m* . Then, with probability at least 1 — n ~ clogn , either 
lC y {m — l) c holds, or 

Y e L (m) G (l±^(t))-(2te- 4 ' 2 v^) (9) 

for every e G 0(G m ). 

The proof is almost identical to that of Proposition 3.8, above, so we shall skip some of 
the details. Recall first the following special case of [2, Lemma 5.20], which is obtained from 
the version stated there by setting a = L and noting that U^(m)V^{m) = X]/eY L (m.) ^/( m )- 

Lemma 5.20. Let u ■ n 3 / 2 < m ^ m* . If £{m) fl U{m) fl X{m) fl Z{m) holds, then 

for every e G 0(G m ). 
Define 

(Ly(m) = 2_^ H -Y M 
9 ,(t)Y(m) 

the normalized error. Since X(m) <C Y(m) 2 for t > u, it follows from Lemmas 5.20 and A. 2.1 
that the variables F e L are self-correcting. 

Lemma A. 2.3. Let u -n 3/2 < to < m* . If £{m) D W(m) D A"(to) n Z(m) n Q(to) /io/ds, tfien 

E[A(l?)*(ro)] G ^(-(y e L rN± £ ) 

/or every e G 0{G m ). 

Next, recall the following lemma from [2, Section 5.3]. 
Lemma 5.17. Let u ■ n 3 / 2 < to ^ m* . If £(m) fl Z(m) holds, then 

| AF e L (to) I < (logn) 3 

for every e G 0{G m ). 

Using Lemmas 5.17 and A. 2. 3, we can bound |A(F e L )*(m)| and E[|A(F e L )*(m)|] . 
Lemma A. 2. 4. Let u ■ n 3/2 < m < m* . If £{m) C\U(m) D X(m) H Z(m) H Q(m) holds, then 

/or every e G 0{G m ). 

In order to deduce Lemma A. 2. 4 from Lemma 5.17, we shall use the following easy conse- 
quence of Lemma 4.23. 



Lemma A. 2. 5. For every u ■ n 3 / 2 < m ^ m* , ifU(m) holds, then 

|A(nYMI^4.-f^ M + l0gn 



g x (t) V Y(m) n 3/2 
for every e e 0(G m ). 

Proof. We apply Lemma 4.23 with A{m) = Y e L (m) and g(t) = g x (t). Since Y{m) is equal to 
t • e~ At times a function of n, and g x (t)Y(m) is equal to t ■ e~ 2t times a function of n, we 
have 

Ay(m)€ ~ 8t ^ ° (1) .y(m) and A(^(t)F(m)) g ~ 4t * .fl,(t)y(m), 
and hence 

l Af »l « and l A MW™))l « ^ ■s.PH. 

Moreover, the event W(m) implies that 2 • Y e L (m) < (l + g x (t))Y(m) for every e G 0(G m ), 
and so 



as claimed. □ 

We can now easily deduce Lemma A. 2. 4. 
Proof of Lemma A. 2. 4- By Lemmas 5.17 and A. 2. 5, we have 

rL N| ^ 4 /|AY" e L (m)| logn^ ^ C (logn) 3 



I A K )*("») < ^ ■ ^™ + < ^ , 

as claimed. Moreover, since 

E[\AY e L (m)\] < y(m) < logn 

f(m) ^ Q(m) ^ ™ 3/2 ' 
it follows from Lemma A. 2. 5 that 

j Wlj ^ Ut) y fM + n3/2 j ^ ^ n3/2 , 

as required. □ 

Finally, let's deduce Proposition 5.5. 

Proof of Proposition 5.5. We begin by choosing a family of parameters as in [2, Defini- 
tion 3.4]. Set K{m) = ^(m)nW(ra) n {|(F e L )*(a)| < 1/2} and I = [a,b] = [u ■ n 3/2 , m*], 
and let 

C (logn) 3 , n . . C logn 

V ; g x (t) v 7 ^ n 3 / 2 
Moreover, set A = C, 5 = e and h(t) = t ■ n~ 3//2 . We claim that (A, 5; g x , h; a, (3; /C) is a 
reasonable collection, and that F e L satisfies the conditions of [2, Lemma 3.2] if e e 0(G m ). 



To prove the first statement, we need to show that a and (3 are A-slow, and that 

et 



min {«(£), /3(t), h(t)} ^ 



n 



3/2 



and a(t) ^ e 2 for every u < t ^t*, each of which is obvious, since g x {t) ^ 1 for all t ^ t*. To 
prove the second, we need to show that Y e L is (g x , h; /C)-self-correcting, which follows from 
Lemma A. 2. 3, that, for every to ■ n 3 / 2 < to < to*, if /C(to) holds then 

|A(F e L )*(m)| < a(t) and E[|A(r e L )*(m)|l < /3(t), 



which follows from Lemma A. 2. 4, and that |(F e L )*(a)| < 1/2, which follows from /C(m). 
Observe that 

/ \ n , x s/9 C 2 (logn) 4 1 

0*(t) 2 V™ (lognf 
for every u < t ^.t*. By [2, Lemma 3.1], and summing over edges e G E(K n ) the probability 
that e G 0(G m ) and (F e L )*(m) > 1, it follows that 

p(w(to) c n /C(to - 1) for some to G [a, 6]) < n 6 exp ( - 5'(logn) 3 ) < n- clogn . 

Finally, we remark that 

P(/C y (a) n {|(n L )*(a)l > 1/2}) < n~ clog ", 
by Proposition 4.56 (see Section 4, below), and so the proposition follows. □ 

3. Section 4: Everything Else 

In this section we shall give the details omitted from [2, Section 4], recall from some of 
the lemmas from that section which we shall need below, and prove some simple variants. 
Recall the following important definitions from [2]. 

Definition 2.10. Define 

t* A (F) = inf |t > : N A (F)(m) < (2t) e(F) } G [0, oo] (10) 

and 

t A (F) = min | min {t* A (H) : A C H C F}, t* }. 
We call £a(-F) the tracking time of the pair (F,A). 

Moreover, for each graph structure pair (F, A) with £a(F) > 0, define 

c = c(F,A) : = max! max <j- — 2 °[ H ^ /rr . \, 2 I, (11) 
V ; \ achcf \ 2va(H) — e(H) J J ' v ; 

so in particular e c * 2 = n 1 / 4 when t = t A (F) if *a(F) < t*. 
We begin with the following crucial remark. 

Remark 4.2. Let (F, A) be a graph structure pair with v(F) = n°^\ and let to G [to*]. 
If the event £(m) holds, then N < j } (F)(m) satisfies the conclusion (i.e., either (a), (b) or (c)) 
of [2, Theorem 4.1], for every faithful injection <p: A — > V(G m ). 
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Proof of Remark 4-2. Let us denote by (F', A', <p') the graph structure obtained by removing 
the isolated vertices from F, so A' = V(F') n A and 0' = <j>\ A ,. Note that t A >(F') = t A (F), 
since t* A (H' U A) = t* A ,(H') for every A' C H' C F'. Assume that the event £(m) holds and 
that is faithful at time t. We shall consider in turn the three cases corresponding to parts 
(a), (b) and (c) of the theorem. 

Suppose first that < t < cu < t A (F), and note that 

NpiF'Xm) G N A/ (F')(m)±f F/ , A/ (t)NA'(F')(n 3 / 2 ), 

since the event £{m) holds, and t A >(F') = t A (F). Now, each copy of F' in G m , rooted 
at (j)'(A'), extends to between (n — v(F)) k and n k copies of F rooted at 4>{A). Set k = 
v A (F) - v A ,(F'), and note that N A (F) = n k ■ N A >(F') and that 7 (F, A) > 7 (F', A') if k ^ 0. 
It follows that 

jV (F)(m) G A> A (F)(m)±/ F , A (t)A> A (F)(n 3 / 2 ), 

as required. 

The proof in case (b) is almost identical, so we skip the details and proceed to case (c). 
Here we need the observation that the minimal A C if C F such that t < tn{F) contains 
no isolated vertices. Indeed, this is obvious, since moving an isolated vertex from H to F 
can only increase iff(F), and t > t A (F). It follows that the minimal A C if C F such that 
i < t H (F), is equal to A union the minimal A' C H' C F' such that t < t H >(F'). Since 
A(F', if', A') < A(F, if, A) unless k = 0, it follows that 

A^(F)(m) ^ n fe -(logn) A(F '' H '' A ' ) iVH'(F , )(m + ) ^ (logn) A ( F ^iV^(F)(m + ), 

by the event S(m), as required. □ 

We remark that the same argument implies that the building sequences 

A C H C • • • C H e = F and A' C ff£ C • • • C H[ = F' 

of (F, A) and (F', A') respectively are identical, in the sense that Hj — if j U A for every 
^ j ^ I — 1. That is, all of the isolated vertices of F lie in V(F) \ if^_i. 

We next recall three simple properties of the collection J 7 ^, see [2, Section 4.2]. 

Observation 4.20. Let (F, A) be a graph structure pair. Then \Fp\ = e(F), and 

2te 4 * 2 • N A (F°)(m) = ^fa ■ N A (F)(m) 

for every F° G T F . 

Observation 4.24. If (F, A) is a graph structure pair and F° G Tp, then t A (F) < t A (F°). 

Proof. If t A (F°) = t* there is nothing to prove, so suppose that A C H° C F° satisfies 
^(ff°) = ^(F°) < t*. Then either ff° C F, in which case t A (F) ^ ^(ff°) = ^(F°), as 
required, or if ° G J 7 ^ for some A C- H <Z F . 

We claim that, in the latter case, we have t A (F) ^ f^(ff) ^ ^(ff°) = t^ )- Note that 
e(if) = e(ff°) — 1, and therefore, by Observation 4.20, 

7V^(if )(m) = ^-^(if)(m) > ^-iV A (if)(m) > (a)^" 1 = (2t)^ 
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for every t ^ t* A (H) < t*. By (10), it follows that t* A (H) ^ t* A (H°), as required. □ 

Observation 4.25. Let (F, A) be a graph structure pair with v A (F) ^ 1, and let F° G T F . 

Then 

( F )9y(t) < 9fAP) and 9F°A f ) < QfA 1 ) 

as n — )> oo. 

Proof. Observe first that c(F, A) ^ c(F°, A) > 2 for every F° G F£, by Observation 4.24. 
Indeed, c = c(F, A) was chosen (see (11)) so that e ct = n 1 / 4 at t = t A (F), and t A (F) ^ 
t A (F°). Now, noting that u A (F) = v A (F°), e(F) = e(F°) + 1 and o(F) = o(F°) - 1, we have 

7 (F,A) ^ 7 (F°,A) + V r 7 (F,A), 

by the definition of 7(F, A). □ 

Recall next that 

f FtA (t) = e ^(^)+l)(* 2 +l) n -l/4 (logn) A(F,A)- v ^I) (12) 

for each graph structure pair (F, A), and that 

/„(*) = e ct2 n' l /\\ognf' 2 and / x (f) = e^/^)- 
The following variant of [2, Observation 4.25] follows easily from the definitions. 

Observation A. 3.1. Let (F, A) be a graph structure pair with v A (F) ^ I, and let F° G T F . 

Then 

(logny^+< F \f y (t) + f FO , A (t)) « f F>A (t) 

as n — )> oo. 

Proof. Since e(F°) = e(F) - 1 and o(F°) = o(F) + 1, we have 

A(F,A) = (5(F°,A) + 1) C > A(F°,A) + ^A(F,A). 
The claimed bound now follows immediately. □ 

Our next few results give useful properties of the collection of graph structures T FA which 
was defined in [2, Section 4.3]. The first four were also proved there. 

Observation 4.32. Tp~ A C T FA for every graph structure pair (F,A). 

Note that, by this observation, the following results all also hold for each (F', A') G Tp A . 

Observation 4.34. Let (F, A) be a graph structure pair, and let (F',A') G F^ A . For every 
A' C H' C F' , we have N H ,(F') < N WnF (F). 

Observation 4.37. Let (F, A) be a graph structure pair, and let (F',A') G T FA . Then 

A(F',H',A') < A(F'-u,A) ^ A(F, A) - 3a/A(F, A) 
for every A' C F' C F'. Moreover, the same bounds holds if H' = F' and A' H F ^ A. 
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Observation 4.39. Let (F, A) be a graph structure pair, and let (F',A!) G T F A . Then 

A(F',A') ^ (l + e)A(F,A). 

The following lemma was stated but not proved in [2]. 

Lemma 4.41. Let (F,A) be a graph structure pair with tA(F) > 0, and let (F',A') G F FA . 
//(logn)T^) < n v A (F)+e(F)+i ) then 



max { A(F', A') - A(F, A), y/A{F,A)} 
Proof. We have, as in the proof above, 



e 2 log n 
v A (F) log log n 



\A(F', A') - A(F, A) | + y/A{F,A) ^ (6{F, A) + 2) - 5(F, A) c + 5(F, A) c ' 2 



^C5(F,Af-^ AC - A ^ - 4 A ^ 



S(F,A) 



C v A (F) 2 



Note that e(F) ^ 2v A {F) - 1, since t A (F) > 0. Since (lognp^ ^ n v A {F) + e{F)+i ^ it foUows 
that 

v A (F)+e(F) + l -y(F,A) loglogn A(F, A) loglogn 
va{^ ) ^ ~ ^ • 



It follows that 

\A(F',A')-A(F,A)\ + ^/AjF^j ^ 

as claimed. 



3 logn 
16 



log n e 



logn 
logn 



C ■ v a(F) loglogn ^ va(F) loglogn' 
We next prove three new lemmas about the collection T FA . Recall Table 4.1. 



□ 





(a) 


(b) 


(c) 


(d) 


(e) 


(/) 


v A -(F-)-v A (F) 





-1 





-2 


-1 


-1 


e(F-) - e(F) 


1 


«S 


1 


< o 


< 


< 1 


o(F-) - o(F) 





«S 





< o 


< o 





Table 4.1 

Lemma A. 3. 2. Let (F, A) be a graph structure pair with t A (F) > 0, and let (F f , A') G F FA . 
If (logn)^ F ' A ) < n v A (F)+e(F)+i and t ^ Uj then 

f F , tA ,(t) < min{n £ , (logn) £A ( F ' A )} • (logn) A ^-V^). 
Proof This follows from the definition (12), using Observation 4.39 and Lemma 4.41. □ 
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Lemma A. 3. 3. Let (F, A) be a graph structure pair with tA(F) > 0. // (F',A f ) G T FA , 
then 

N A ,{F'){n" 2 ) < ■ N A {F){n^). (13) 

\/n 



Proof. Note that N A (F)(n^ 2 ) = 2 e ^ e - 4o ^n VA ^- e( - F ^ 2 for every pair (F, A), since t = 1 
when m = n 3 ^ 2 , and that e(F') ^ e(F) + 1. We are thus required to prove that 

v A ,{F') - v A (F) < e(jP/) ~ 2 e(F) ~ 1 . (14) 

In cases (a) and (c), this follows immediately from Table 4.1. In cases (b), (d), (e) and (/) 
we need the following extra observation: tA(F) > implies that t* A (A' fl F) > 0. This gives 

e(F) - e(F) ^ e(A' n F) < 2v A (A' n F) — 1 = 2(t» A (F) - v A ,(F)) - 1, 

as required. □ 

Lemma A. 3. 4. Let (F,A,(f>) be a graph structure triple, and let (F',A') G Fp A . Suppose 
that tA'(F') < t ^ uj < t A (F), and that <fi' : A' — > V{G rn ) is faithful at time t. If £{m) holds 
and {\ogn) A ^ ^ n 2e ^ +2 , then 

N^F'){m) < -= N A {F){n 3 ' 2 ). (15) 

Proof. Note that since t A >(F') < t ^ uj, it follows that t A /(F') = 0. There are two cases: 
either (F',A') is balanced, or it is not. In the former case, since M.(m) holds we have 



AV(F')(m) < (logn) A ( F '-^ A ') < (lognf^^v 7 ^), 
and so in this case we are done. In the latter case, since S(m) holds we have 

AV(F')(m) < (\ogn)^ F '' H '^N H/ (F')(m + ), (16) 
where A' C H' C F' is minimal such that t < t H i(F'), and m + = max{m, n 3//2 }. Now 

(logn) A ( F '^'' A ')iV^(F / )(m + ) < (logn) A ^- 3 v / ^^jV H (F)(m+), 

where H = H' (~) F, hy Observations 4.34 and 4.37. Next, note that, since t A (F) > 0, we 
have t* A (H) > 0, and thus e(H) ^ 2v A (H) - 1. Hence 

N A (H)(n 3 / 2 ) > 2 e ( F ) e - 4 °Wv^. 
Since < t < it follows that 

, e(F) 80(F) 

iV„(F)(m + ) < co e ^e 4 °^ ■ N H {F){n 3 ' 2 ) ^ ^= • N A (F)(n 3 ^ 2 ), 

y/n 

and thus 

(lozn) A(F ' A) - 2 ^ A(F > A) ~ 

N^F'){m) < = iV A (F)(n 3 / 2 ), 

A/n 

as claimed. □ 
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4. Section 4.6: the land before time t = u 

In this section we shall use the method of Bohman [1] to control the variables N^F) up 
to time t = u, assuming that t A (F) > 0. Recall once again that 

f FtA (t) = e c ^)+ 1 )( t2 + 1 )n- 1 / 4 (logn) A ^ A )-v / ^) 

for each graph structure pair (F, A), that 

f y (t) = e&n-V'Qogn) 5 ' 2 and f x (t) = e'^ f y (t), 

and that )C £ (m) = y(m) n Z(m) fl Q(m). We shall prove the following proposition. 

Proposition 4.55. Let (F,A) be a graph structure pair, and let < t ^ u < t A (F). Then, 
with probability at least 1 — 7i _31ogn ; either (£(m — 1) fl M.(m — 1) fl )C £ (m — l)) c holds, or 

N^F){m) G N A (F)(m) ± f FjA (t) ■ N A (F)(n 3/2 ) (17) 
for every (f>: A — > V{G m ) which is faithful at time t. 

We remark, for future reference, that (17) holds trivially if (\ogn) A( - F ' A ^ > n e( - F ^ +2 , since 
then f FtA (t) ■ N A (F)(n 3/2 ) > n" A ^ F \ and that we have e(F) < 2v A (F), since t A (F) > 0. 

The proof of Proposition 4.55 relies heavily on the fact that the event y(m) gives us 
stronger bounds (in the range t ^ u>) on the variables Y e than those given by the event 
£{m). The bounds we need are essentially due to Bohman [1], although he stated a slightly 
weaker version of the following proposition. For completeness, we shall sketch the proof. 

Recall from [2, Section 5] the definition of the variables Y^{m). We shall need the following 
slight strengthening of [2, Proposition 4.56] in [2, Section 5] in order to show that the event 
14(a) holds, where a — u ■ n 3 / 2 . 

Proposition 4.56 (Bohman [1]). Let m < u ■ n 3//2 . With probability at least 1 — n ~ cl ° en , 
either (Z(m — 1) fl Q(m — 1))° holds, or 

X e (m) G X(m) ± f x (t)X(n 3/2 ) and 2 • Y e L (m) G Y(m) ± f y (t)Y(n 3/2 ) (18) 

for every e G 0(G m ). 

Recall the following martingale lemma from [2, Section 3]. 

Lemma 3.2. Let M be a super-martingale, defined on [0,s], such that 

-(5 < AM (to) sC a 

for every m G [0, s — 1]. Then, for every ^ x ^ min{a, (3} ■ s, 

F(M(s) > M(0)+x) ^ exp ^-JL^j . 

Proof of Proposition 4-56. Consider the event that m ^ cu ■ n 3 / 2 is the minimal value of to 
such that the event Z(m) fl Q(m) holds, and moreover either 

X(m) c holds, or 2 • Y e L (m) £Y(m)± f y (t)Y(n 3/2 ). 

We shall consider the two subcases: X(m ) holds, or it does not, separately. 
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Suppose first that X(m ) c holds, and note that X(m) = 6(e~ 8 ' 2 • X(n 3 / 2 )) and Y(m) = 
e(t ■ e' 4t ' 2 ■ Y(n 3/2 )). By (2), and using the event X(m) D y(m) n Q(m), which holds for all 
m < mo, we have 

E[AX e (m)} = 



Q{m) 



fex e (m) 



lie- /,(t)e 4 ' 2 ) (X(m) ± / x (f)X(n 3 / 2 )) (y(m) ± /,(t)f(n 3 / 2 )) . 



Q(m) 

Multiplying out the brackets, we obtain 

X{m)Y{m) ± 0(ete- 8t2 f y (t) + te-*7 x (t) + f v (t) + /,(t)/,(t))x(n 3 / 2 )F(n 3 / 2 ), 
and hence, recalling that f y (t) = e 4 * 2 f x (t) <C e~ 8w2 , we obtain 



E[MeH] e 



16* 

n 3/2 



It follows that 



Mf(m') 



m'-l 

E 

m=0 



■X(m) ± VC- 



16t 



t+ 1 

n 3/2 



f x (t)X(n 3 ' 2 ). 



AX e (m) + — -X(m) ± VC 

n ' 



t + 1 



TV 



3/2 



/x(*)*(n 



3/2 \ 



is a super- /sub- martingale pair on ^ m' < m . Moreover, the number of open triangles 
which are destroyed in step m + 1 of the triangle-free process is at most 

max F e (m) ^ \pn 

eeO(G m ) 

since iV(m) holds for every m < m Q . It follows that — yfn ^ AX e (m) ^ 0, and hence 

C 



- 2yjn <: AM| e (m) ^ 

for every e G 0(G m ), while [V(m) holds. Now, set 
C , /,(a;)X(n 3 / 2 ) 



n 



(19) 



+ 



and (3 = 2\fn + 



/ x (u;)X(n 3 / 2 ) 



n mo mo 

and observe that, since f x (t)X(n 3 ^ 2 ) ^ n 3 / 4 (logn) 5//2 and we may assume 3 that m ^ n 1 / 4 , 

(/.(t)X(n 3 / 2 )) 2 



m < 

Hence, applying Lemma 3.2, we obtain 



(log n) 



p((M- e (m ) > ^(t)X(n 3 / 2 )) n/C(m- 1)) < e^") 3 , 



3 If m sC n 1 / 4 , then (19) implies that X e (m ) >n- <3(n 3 / 4 ), and hence X(m ) U y(m - If holds. 
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and similarly for Mj£ . Finally, noting that 



1 



m'-l 



m'-l 



lib _1_ -i lit X -. 

J2^t-X(m)e (l-e- 8t ' 2 )n±l and — (t + 1) f x (t) ^ - ■ f x (t') , 

m=0 m=0 

it follows that, with probability at least have 

m'-l 



X e (m) 6 n - )^ 



16* 

3/2 



X(m) ± VC 



m=0 L 

G X(m') ± ^(t')X(n 3 / 2 : 



t + 1 

n 3/2 



f x (t)X(n 3 / 2 ) 



± ^(t')X(n 3 / 2 ) 



for every e G 0(G m ), as required. 

The proof in the case that X(mo) holds is similar. The first step is to break AY e L (m) into 
two pieces, Cj(m) and D^(m), where Cj(m) denotes the number of edges of F e L (m + 1) 
which were created in step m + 1 of the triangle- free process, and D^{m) denotes the number 
of edges of Y e L (m) which were closed in that step. It follows immediately from the definition 
that 

m! — 1 



nV) = E (Cl(m)-Dl(m)). 



(20) 



m=0 



We claim first that 



^ n y ( 1 1 X e {m) {l±0(e)e^f y (t))(X(m)±f x (t)X(n^)) 

]U.|G e (m) I (j m \ = - ■ — - G 



2 Q(m) 



2 • Q(m) 



c 2e-^ 2 n ± (e 4f2 / a (t) + Q(e)/ y (t))l(n 3 / 2 ) c 2e^ 2 ± f y (t) 



where we used the event X(m) fl Q(m), and the facts that f y (t) = e f x (t), and that 
X(m) = 6(e- 8 * 2 • X(n 3 / 2 )) > / ;E (t)X(n 3 / 2 ) for t s$ w. Similarly, we have 



E[Dl(m)\G m ] = E y /M e 



(1 ± 0(e)e* f v (t)) (y(m) ± /„(f)y(n 3/2 )) : 



2 • Q(m) 



C 



C 



±0(e. fV*7 s (f) + te -- 2 /,(t) + /,(t) 2 ) 
2 • Q(m) v ' Q{m) 

m\-^±{e + o{t))f y {t) 



n 



using the event y{m) fl Q(m), and the fact that Y(m) = Q(t ■ e 4t ■ Y(n 3/2 )). Hence 



m'-l 



MUm>) = £ 



2e-«-±/ w (t)' 



m=0 
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and 



D e (m) 



m=0 



n 



are both super-/sub-martingale pairs while the event X{m) fl y(m) D Q(m) holds. 
Next, observe that 

< Cj(m) ^ 1 and < D^(m) «C (logn) 2 , 

while Z(m) holds, and hence 



C 



n 



sC AMj y (m) sC 2 and 



C 



< AMj y (m) < 2 (logn) 2 . 



Now, set 



a = 2(logn) 2 + 



m 



and ^£ + 4M». 
n m 



and observe that, since f y (t)Y(n 3 ^ 2 ) ^ n 1 / 4 (logn) 5 / 2 and we may assume that m ^ n 1 / 4 , 

(/,(t)r(n^)) 2 



a ■ (3 ■ m ^ 
Hence, applying Lemma 3.2, we obtain 



uj • (logn)' 



P( ( M- Y (m] 



> ^f y (t)Y(n 3 / 2 )^ n K{m - 1)^ < n- 2Clogn , 



and similarly for M+y, M cY and M^ y . 

Finally, note that £ f (m) = (4 - 32t 2 ) e" 4 ' 2 ^ and ^ E^f 2 + l)/y(*) < b ' 
Hence, with probability at least 1 — n~ 2Clogn , we have 

m'—l 

2.Y e (m') = 2-J2 {c Y e {m)-D Y e {mj) 



m=0 
m'-l 

* E 

m=0 



(4 - 32* 2 ) e - 4t \ OC* 2 + !)/»(*) 



n 



n 



± 2/«(0^(n 3/2 ) 



G Y(m') ± / s (OY-(n 3 / 2 ), 
as required. Putting the pieces together, it follows that 

f( (J (AT(m) n y(m)) c n Z(m) n Q(m)j < n- clogn , 
and moreover the same holds if we replace y(m) by the event in (18). 



□ 
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4.1. Proof of Proposition 4.55. Returning to the proof of Proposition 4.55, let us fix a 
graph structure triple (F,A,4>) with t A (F) > 0. The first step in the proof is to break up 
N^F) as follows: 

m'-l 

A^(F)(m') = <C,(F)(m) - D*(F)(mj), (21) 

where C^{F){m) denotes the number of copies of F rooted at <fi{A) which are created at step 
m + 1 of the triangle-free process, and £>^(F)(m) denotes the number of such copies which 
are destroyed in that step. Let's deal first with C^(F). It was proved in [2, Lemma 4.21] 
that 

E[C^F)(m)\G m ] WW (22) 

for every <fi which is faithful at time t. Using the events S(m) and Q(m), we easily obtain a 
super- /sub-martingale pair. 

Lemma 4.58. Let (F, A, <fi) be a graph structure triple, and suppose that <fi is faithful at 
time t, where < t ^ ui < t A (F). If £{m) fl Z{m) n Q(m) holds, then 

"WW I «.] - e(F) i A ^ )( '" > e ± M^K^) . (23) 

Proof. This follows from (22) via a straightforward calculation. Note first that the lemma 
holds trivially if e(F) = 0, since then Q,(F)(m) = for every m G N. So assume that 
e(F) > 0, and recall that the event Q(m) implies that Q(m) G (l ± e ■ e u f y (t))Q(m), and 
that the event £{m) (and the fact that t < u < t A (F) < t A (F°), by Observation 4.24) 
implies that 

iVF°)(m) G iV A (F°)(m) ± /^(t) • N A {F°){n^ 2 ). 

Moreover 2te 4 * 2 • 7V A (F°)(m) = y/n ■ N A (F)(m) and |F£| = e(F), by Observation 4.20, and 
hence, by (22), 

E[C,(F)(m) | G m ] G 1 ± ^ f f ) (UF)(m) e -^^ ± £ /^(0^(*> 3/2 ) • ^ 
Thus the left-hand side of (23) lies in the interval 

± 7^ ■ fv®**W( m ) + E /^)^(F)(n 3 / 2 ) ■ ^ 

<2(m) V 2t F ^ e 

Since 7V A (F)(m) = t e ( i? )e~ 4o ( F ^' 2_1 )iV4(F)(n 3 / 2 ), this is in turn contained in 

as required, where the final inequality follows since e(F) > and 

(log n)^)+°^ (/,(*) + /^(f)) « f F ,A(t) 
by Observation A. 3.1. □ 
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We shall also need a corresponding lemma for the variables D^F). It was proved in [2, 
Lemma 4.21] that, if Z(m) holds and (ft is faithful at time t, then 

R[D*(F)(m)\G m ] e -L7 E F /M ± o(F) 2 (logn) 2 A^(F)\ (24) 

We shall use the events £(m), y{m) and Q(m) to obtain a super-/sub- martingale pair. 

Lemma 4.59. Let (F, A, (ft) be a graph structure triple, and suppose that < t ^ ui < t A (F), 
and that (ft is faithful at time t. If £{m) fl y(m) fl Z(m) fl <2(m) /ioWs, £/ien 

4WMIC]- 8 '-^ ww 6 ± c. w ; ( f + i) ^ (f) ^ (F)(n3/2) (25) 

Proof. Note first that if o(F) = then the result holds trivially, since in that case copies of 
F cannot be destroyed, and so D^,(F)(m) = for every m e N. So assume that o(F) > 
and recall that, since £(m) fl y(m) fl Q(m) holds and t ^ u < t A (F), we have 

Q{m) e (1 ± 5 • e 4t 7 y (*))0(m), 17(m) G F(m) ± f y (t)Y(n 3 / 2 ) 

and 

N+(F){m) E N A {F){m) ± f FA {t) ■ N A {F){n 3 ' 2 ). 
By (24), it follows that Ej\D^(F)(m) | G m ] is contained in the interval 



l±e-e^f y (t) 



o(F) ■ (N A (F)(m) ± f F At)N A {F){n^)) (y(m) ± 2/,(t)F(n 3 / 2 )) , 



Q(m) 

where we used our assumption 4 that o(F) ^ n o(1 ) ^ n 1 / 4 ^ f y (t)Y(n 3 ^ 2 ) to absorb the final 
error term in (24). It follows that the left-hand side of (25) is contained in 

± o(F) ■ Y(n 3 / 2 ) 
Q(m) 

since Y[m) = 0(t) ■ e~ 4f2 ■ Y(n 3 ^ 2 ). This in turn is a subset of 



(t- f y (t) + 2f y (t))N A (F)(m)+(0(t).e- u2 + 2/ s/ (t))/^(t)iV A (F)(n 3 / 2 )Y 



± C ■ o(F) 



(t< F \t + l)e^ f y (t) + {t + e^f y (t))f FA (t))-N A (F)(n 3 l 2 ). 



n 3/2 

since N A (F)(m) = N A {F){n 3 / 2 ) ■ fe(F) e -4o(F)(t*-i) and ( F ) > Q 

Finally, note that e 4t ' 2 f y (t) < 1 for every t ^ w, and recall that (log n )< F)+o(F) f y (t) < 
fF,A(t), by Observation A. 3.1. It follows that 

E [0 ^)W| Gm ]- 8i ""^W"") e ± ^W). /F .( ( )^ W( „3/ 2) , 

as required. □ 

In order to use Lemma 3.2, we shall need bounds on C<p(F)(m) and D^(F)(m) which hold 
deterministically for all < t ^ u>. We shall prove the following bounds. 



4 Recall that Proposition 4.55 is trivial if o(F) ^ (logn) 1 / 5 . 
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Lemma 4.60. Let (F, A, <p) be a graph structure triple, and suppose that < t ^ ui < tA^F), 
and that <fi is faithful at time t. If £{m) fl M.{m) holds, then 

r «i (\o<yn) A(F ' A) - 2 V A ( F > A ) _ 
^ C${F){m) ^ min < n e , (logn) eA ^ . Na{F)(ti ' ). (26) 



/n 

Moreover, the same bounds also hold for D^{F){m). 

The required bounds on C0(F)(m) follow easily from Lemmas A. 3. 2, A. 3. 3 and A. 3. 4. 
Proof of Lemma 4-60. Recall first that, by [2, Lemma 4.28], we have 

CV(F)(m) < V max N^{F'){m). 

(F',A')eT+ A 

There are two cases to consider: t ^ uj < tA'(F') and t > tA'(F') = 0. Set 

T(F,A) := min jn, (logn) A ( F - A )}, 
and recall that, by Lemmas A. 3. 2 and A. 3. 3, we have 

fF'A*) < ^(F,A) £ ■ (\ogn) A ^ A ^ 3 V A ( F > A ) and A> A ,(F')(n 3/2 ) ^ ■ N A (F)(n 3 ' 2 ). 

v ^ 

Together with the event S(m), this implies that 

AV(F')(m) < iV^(F')(m) + (t) • N A '(F')(n 3 / 2 ) 

(losn) AiF ' A) ^ A{F ' A) ~ 
Vn 

In the latter case, Lemma A. 3. 4 gives us 

(\osn) A(F ' A) - 2 V A ( F ' A ) _ 
v 71 



Since IJ-J^I ^ v a(F) 2 ^ (logn) 2 / 5 , the upper bound in (26) follows. The proof of the bounds 
on D<p(F)(m) is identical, using [2, Lemma 4.30]. □ 

We can now apply Lemma 3.2 to the variables C^{F) and D^F). 

Proof of Proposition 4-55. For each m G [m*], set K(m) = £(m)<lM(m)<l)C £ (m). We shall 
bound, for each m ^ to ■ n 3 ^ 2 , the probability that m is the minimal m E N such that 
K{m— 1) holds, and 

N^F){m) N A (F)(m) ± f FjA (t) ■ N A (F)(n 3/2 ) 

for some which is faithful at time t — m • n~ 3 l 2 . Note that the event in the statement of 
the proposition implies that this event holds for some m ^ to ■ n 3 ^ 2 . 
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Fix m ^ ijj ■ n 3 / 2 , and for each m! ^ m , define random variables 



M c( m> ) = E 

m=0 

and 

m'—l 



r rirum] e(F) ■ N A (F)(m) f FA (t) ■ N A (F)(n^) 
C,{F){m) — m ± 



M±(m') = £ 



m=0 



D^){m) -3^ ± ^3/2 J F,A\t)N A\F ){p> ) 



It follows from Lemmas 4.58 and 4.59 that, while the event £(m) fl 3^(^) H Z(m) fl Q{m) 
holds, and are both super-/sub-martingale pairs. Now, set 

where T(F, A) = min {n, (logn) A ( F ' A )}, and 



P = 

By Lemma 4.60, we have 



/ (lo gn) ^> + /^v 



-/3 ^ AMp(m) + AM^(m) ^ a 

while £(m) n .M(m) holds. Moreover, since /f,a(*o) ^ n- 1 / 4 (logn) A(F ' A) ~v /A ( F ' A ) ) and we 
may assume that mo ^ n £ , we have 

a- (3 -m /f.a^o) 2 



iV A (F)(n 3 / 2 ) 2 (logn) 4 
Hence, by Lemma 3.2, we obtain 



P^Mo(mo) > ^(*o)^(F)(ri 3/2 )) n /C(m - 1)) < e^") 3 , 

and similarly for M^t, and Mj. 
To complete the proof, note that 

m '- 1 „3/2 

E + W ^) + !) • /^(*) < ~c • 

m=0 

Therefore, if 

max{min{|M+(m)|,|M^(m)|},min{|M+(m)|,|M^(m)|}} < ^/ F , A (t)iV A (F)(n 3/2 ) 
then 

E (<^("0 - ^) W) e E " 8t • ^) ± /^(O^(i0(n 3/2 ). 

m=0 m=0 ^ ' 
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Finally, note that ^ N A (F)(m) = (^p- -8t-o(F))N A (F)(m), and that the number of choices 
for 4> is negligible, since \A\ ^ (logn) 1 / 5 . Hence, with probability at least 1 - n- 31ogn , we 
have 



m'-l 



N 4>( F )( m ') = E (^(^(m)-^)^)) G N A {F)(m') ± /^(t')iV A (F)(n 3 / 2 ), 



m=0 



as required. □ 

4.2. The number of open edges in a set before time t — u. Let us finish this section 
by using Bohman's method to prove [2, Lemma 7.12]. The proof is almost identical to that 
of the bounds on X e (m) in Proposition 4.56. 

Lemma 7.12. Let S C V{G m ), and let M = (A±, . . . , A k ) be a collection of subsets of S. If 
\S\y/n ^ oj^(S,0) ^ n 5 / 4 , then 

p({|o^(s,m)| > 1} ntf(s,m) ny(m) n Q(m)) < n- |s|n45 

/or every m ^ to ■ n 3//2 . 

In fact we shall prove the following, slightly stronger statement. Set 

foit) = n 3S -f y (t), 

and note that we have f (t) <C g Q (t)e~ At2 for every m ^ u ■ n 3//2 , and so the following lemma 
trivially implies Lemma 7.12. 

Lemma A. 4.1. Let S C V{G m ), and let Af = (A ± , . . . , A k ) be a collection of subsets of S. 
Suppose that \S\y/n ^ oj\f(S, 0) ^ n 5 ^ 4 , and let m ^ uj • r?l 2 . Then, with probability at least 
1 - n -\s\n 4S either (tf(S,m)ny(m) n Q{m)) c holds, or 

o M (S,m) e e- 4t2 o M (S,0) ± f o (t)o M (S,0). (27) 

Proof. Let m ^ to ■ n 3//2 be minimal such that the event Af(S, m ) D 3^ (mo) H <2(m ) holds, 
and 

o x (S,m ) £ e~<o M (S,Q) ± f o (t )o„(S,0), 
where t = m • n~ 3//2 . It follows that, if m < m , then 

E[Ao M (S,m)} = E r /( m ) 

Vl ^ /eOjv(S,m) 



^M(l±e 4 * 2 / W) 2 (^M±/,(^(' 
Q(m) v 



ft _4i2 . / t + 1 



C -e- 0^(5,0) ± VC I ] -/oWo^O), 
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since Y(m) = Q(t ■ e 4 * 2 ■ F(n 3 / 2 )), and using the event y(m) fl Q(m) and the bounds (27), 
which hold for all m < m . It follows that 



m'—l 



m=0 



8t 



Ao M (S, m) + • e' 4 ' o^S, 0) ± VC ■ 



n 



t + 1 

3/2 



f o (t)o M (S,0) 



is a super- /sub- martingale pair on ^ m! < mo- Moreover, we have 

-n 5 < AoAr(.S,m) ^ 
for every m < m , since Jx(S,m) holds (see [2, Section 7.4]), and so 

-2-n s <: AM£ M (m) < 5 ' °) 



3/2 



f / Q" >> /o(t)oAT(>g,0) ^ f o (t)Q M (S,0) 2 65 

for every m < m . Since m ^ — ^ — 7 > — — — ^ — and f (t) z o^f(S, 0) > no, 

it follows by Lemma 3.2 that 



P( M- e (m ) > -f o (t)o x (S,0) ) < exp 



f o (tfo M (S,0)\ ^ n ^ ni s 



C 2 -n 5 



and similarly for M-J . It follows that, with probability at least 1 — n I 5 '" 4 , we have 



m'-l 



o N (S,m') eo M (S,0)- 



m=0 



8* 

372 



n 



o^r(S,m) ± vC 



t+1 

n 3 / 2 



f o (t)o M (S,0) 



C e- 4t ' 2 o A r(S:,0)± f (t')o^(S, 0) 



as required. 



±2/.(^0) 



□ 



5. Section 6: Whirlpools 

In this section, we shall prove that the variables X, Y and Q follow (in expected value) a 
three-dimensional dynamical system which looks like a whirlpool. Set 

r(m) = * (w) -* (ro) , F*(m) = F(m) - fim) and Q'(m) = Q(m) " ^ 



9q(t)Q(m) 



g q (t)X(m) g q (t)Y(m) 
for each m G N. We shall prove the following lemma. 

Lemma 6.2. Let w • n 3 / 2 < m < m* 7 and suppose that X(m) D iV(m) fl Q(m) /io/ds. Then 
(o) E[AQ*(m)] G 4^(-2F*(m) + g*(m)±£^ 

(b) E[AY*(m)] G (lie) • ( - 3F*(m) + 2Q*(m) i £ ) . 

(c) E[AX*(m)] G (lie)-^^-X*(m)-4F*(m)+4g*(m)ie). 



We begin by calculating the expected step-change in the variables X(m), F(m) and Q{m). 
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Lemma 6.3. For every m G N, 

E[AQ(m)] = -Y{m) - 1. 
Proof. This is trivial, since if edge e is chosen in step m + 1, then AQ(m) = —Y e (m) — 1. □ 
Lemma 6.4. Let u; • n 3 / 2 < m < m* . // <Y(m) fl iV(m) fl Q(m) holds, then 

E[AF(m)] G —^—(-Y(m) 2 + X(m) -2- Var(F e (m)) ±0(r(m))Y 

Lemma 6.5. Let w • n 3 / 2 < m < m*. If X(m) fl iV(m) fl Q(m) /ioWs, £/ien 

E[Al(m)] G ^^^-2-X(m)F(m)-3-Cov(X,F)±0(^X(m) + F(m) 2 ^. 

Recall first that the F-graph has vertex set 0(G m ), and an edge between each pair {/, /'} 
such that /' G Yf(m). We begin with a simple observation about this graph. 

Observation A. 5.1. The Y -graph is triangle- free, i.e., for every m G N, there do not exist 
three distinct open edges e, f and g of G rn with e G Yf(m), f G Y g (m) and g G Y e (m). 

Proof. This follows from the fact that G m is triangle-free. Indeed, let {e, /, g} C 0(G m ) 
be open edges which form a triangle in the F-graph, and suppose first that they have a 
common endpoint. Then the other endpoints of e, / and g form a triangle in G m , which is 
a contradiction. On the other hand, if e, / and g do not share a common endpoint, then 
they must form a triangle (since they are pairwise intersecting), which contradicts (e.g.) the 
assumption that e G Yf(m). It follows that no such triple exists, as claimed. □ 

In order to prove Lemmas 6.4 and 6.5, we shall use the variables 

Y(m) = Y ^ m ) and X ( m ) = X e(m), 

eeQ(m) eeQ(m) 

which are exactly twice the number of edges in the F-graph, and six times the number of 
open triangles in G m , respectively. 

We first bound the expected change in Y(m). 

Lemma A. 5. 2. 

E[AY(ra)] = ^(x(m) - 2 £ Y e (m)A. 

^ ' ^ eeQ(m) ^ 

Proof. Suppose we add edge e G 0(G m ) in step m + 1. We claim that 

AY(m) = X e (m) - 2 17(m). (28) 

feY e (m) 

To see this, observe that in step m+1 we remove from the F-graph the vertices corresponding 
to each / G Y e (m), and add a matching between the vertices corresponding to X e (m). Since 
the y-graph is triangle-free, by Observation A. 5.1, and each / G Y e (m) is (by definition) 
a F-neighbour of e, it follows that the number of edges of the F-graph which are removed 
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TO 



is exactly J2feY e ( m ) ^7( m )- Since Y(to) is equal to twice the number of edges in the Y- 
graph, (28) follows. 

Summing over edges e G Q(m), it follows that 

E[Y(TO + l)-Y(TO)|G m ] ~£ [x e (m)-2 £ Y f 

^ ' eeQ(m) feY e {m) 

Now recall that / G Y e (m) if and only if e G Yf(m), and thus 

eeQ(m) feY e (m) feQ(m) 

(Indeed, this is simply the number of walks of length two in the F-graph.) Hence 



E[AY(to)] = 



X(to) 2 
Q(m) Q(m) 



eeQ(m) 

as required. 

Recall the following simple lemma, which was stated (but not proved) in [2, Section 5]. 
Lemma 5.28. 



□ 



E 



Proof. We have 



A(m) 
B(m) 



E[AA(m 



1 



B(m) 5(to) 



E 



A(m + 1)A5(to) 
B(m + 1) 



Gv, 



E 



A(to + 1) A(to) 
B(m + lj ~ 5 (to) 



Gr, 



E 



5(m),4(777 + 1) - 5(to + l)A(m) 
B{m)(B{m + 1) 



1 



fifTO) 



-E 



5(777 + 1) (A(to + 1) - A{m)) + (5(to) - 5(to + l))A(m + 1) 



5(to+1) 



E[AA(m)] 1 
5 (to) 5(m) 



(5(TO) - 5(777 + 1)) A(m + 1) 
5(777 + 1) 



GV. 



as claimed. 



□ 



Recall that Var(F e (777)) = E[Y e (777) 2 ] — Y(m) 2 , where the expectation is over the choice 
of the edge e G 0(G m ), and that the event X{m) D y(m) implies that 



X e (777) G (2 ± o(l))e- 8 ' 2 77 and Y e (m) G (4 ± o{l))te 



it- 



n 



(29) 



for every e G 0(G m ). We can now prove Lemma 6.4. 
Proof of Lemma 6.4- By Lemma 5.28, we have 



E[A7(m)] 



E[AY(tt7 



1 



(5(777) (5(777 



-E 



Y(tt7 + l)AQ(m) 
Q(m + l) 



(30) 
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Now, we claim that 



Q(m + 1) \ Y(m) 2 JQ(m) 
To see this, note that by (28) and (29), since X(m) fl y(m) n Q(m) holds we have 
|AY(m)| < 3 .Y(m) + Y(mf ^ J_ ^ |AQ(m)| < 2 <<; ^ 1 



¥(m + 1) e fl±^Ml^cFM±l. 



Y(m) F(m)-Q(m) ' V(///)- Q(m) ' Q(m) ' Y(m) 2 

Since E[AQ(m)] = -F(m) - 1, and using (30), it follows that 

r . E[AY(m)l F(m) 2 ±0(F(m)) 

E [AY m G L \ ;J + 1 ^ . 

Q(m) Q{m) 

By Lemma A. 5. 2, and since E[Y e (m) 2 ] = Var(F e (m)) + Y(m) 2 , this implies that 

*™ ^(M- + F(m)2 ^f (m)) 

= ^-y^X(m) -F(m) 2 -2- Var(r e (m)) ±0(F(m))), 

as required. □ 

The proof for X{m) is similar. Recall that Cov(X, Y) = E[X e • Y e ] — X ■ Y, where the 
expectation is over the (uniformly random) choice of the edge e G 0{G m ). 

Lemma A. 5.3. If X{m) ny(m) holds, then 

E[AX(m)] G £ X/H-y/H ±0(x(m) + r(m) 2 ). 

Proof. Observe first that X(m) is simply the number of labelled open triangles in G m (i.e., 
six times the number of unlabelled open triangles). We claim that if edge e is chosen in step 
m + 1 of the triangle-free process, then 

-3-X e (m) <: AX(m) + 3 X /( m ) < 6 • F e (m) 2 (31) 

To see the lower bound, note that no new open triangles are created, and each edge / G 
Y" e (m)U{e} which is closed in step m+1 destroys at most Xf(m)/2 unlabelled open triangles. 
On the other hand, an open triangle is destroyed in two different ways only if it has two 
edges in Y e (m) U {e}, and hence we have double-counted at most Y e {m) 2 unlabelled open 
triangles, which gives the upper bound in (31). 

Now, summing over edges e G Q(m), it follows that 

E[AX(m)] G J2 E ± 0{x(m) + Y(mf). 
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Since / G Y e (m) if and only if e G Yf(m), it follows that 

E E X /( m ) = E -ty™). 

eeQ(m) feY e (m) /GQ(m) 

and hence 

E[AX(m)] G E A/(m) • ^(m) ± O (X(m) + F(m) ; 

as required. 

We can now prove Lemma 6.5. 
Proof of Lemma 6.5. By Lemma 5.28 we have 



and we have 



E[AX(m)] 



X(m + 1) 
Q(m + 1) 



E[AX(m)] 

g(m) 



G 



Q(m) 



E 



Xfm) 



F(m) 2 / Q(m 



X(m + l)AQ(m) 
Q(m + 1) 



C X(m) ± 1. 



To see this, note that by (29) and (31), since X(m) fl y(m) fl Q(m) holds we have 



|AX(m)| 4 X(m)-y(m) ^ 1 



and ^gM^ 2 .lM« 1 



Q(m) 



Q{m) ' Y(m) 2 



X(m) X(m) ■ Q(m) " F(m) 2 

Since E[AQ(m)] = -F(m) - 1, it follows that 

r _ / xl E[AX(m)l X(m)Y(m)±0(X(m)+Y(m)) 

By Lemma A. 5. 3, and since E[X e • Y" e ] = X ■ Y + Cov(X, V), this implies that 



E 



[AX(m)] G ( - 2 • X(m)F(m) - 3 • Cov(X, Y)±0 (x{m) + Y{m) 



as required. 

We can now deduce Lemma 6.2 via a rather tedious calculation. 



□ 



□ 



Proof of Lemma 6.2. Let u ■ n 3 / 2 < m < m*, and suppose that X(m) fl iV(m) fl Q(m) holds. 
By Lemma 5.28 we have 



= E[AQ(m) - AQ(m)] _ ^Mfa)),, 



g*(m + i) la 



g q (t)Q(m) g q (t)Q(m) 
Recall that E[AQ(m)] = -F(m) - 1 and AQ(m) G -F(m) ± 1, and note that 



A(^(t)g(m)) G 



4* 



3/2 



&(t)Q(m) ± 1, 
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since g q (t)Q(m) is equal to e~ 2 * 2 times some function of n. It follows that 



E[AQ*(m)] e - nH^ + J_. Q . (m)± J_ c iL(-2FV) + Q'(m) ±E ), 



since the event Q(m) implies that |Q*(m)| ^ 1, and hence 

F*(m)-f(m) { 4t ^ 1 _ 4t 

Q(m) 
as required. 

We turn next to Y (to). Observe that, by Lemma 5.28, we have 

r xl E\AY(m) - AY(m)] A(gJt)Y(m)) r_. xl i 

E Ay to = -t K —L = yyqK 4 v v • E y to + 1 )\G m , 32 

L 1 ;J ^(t)y(m) ^(t)y(m) L J| m J' 1 ; 

and recall that, by Lemma 6.4, if A'(m) D y(m) D Q(m) holds then 

E[AF(to)] G ( - F(m) 2 + X(m) -2- Var(y(m)) ±0(Y(m))). (33) 

Now, since F(to) = 4te _4 * 2 v / n, a simple calculation gives 

a,>, x /8t 2 - 1\ ~ , , 1 -y(m) 2 + X(m) 1 . 

AY m G - ^ y (to) ± — C M ^ ± — , 34 

V ; \t-nV 2 J K ' n 2 " Q{ m ) n 2 ' V ; 

and similarly, since g q (t)Y(m) = 4te~ 2 ' 2 n 1 / 4 (logn) 3 and t ^ uj, we obtain 

A MW)e-(^)^N±^ C -(l±o(l)).^-^)y(m). (35) 

Moreover, since y(m) holds, we have 

Var(y e (m)) < g y (t) 2 Y(m) 2 < g q (t)Y(m) 2 . (36) 

Recalling that F(to) = (l+^(t)F*(m))y(m) and Q(to) = (l+0,(*)Q*(m))Q(m), it follows 
from (33), (34) and (36) that 

E[AF(to)] - AF(to) G -^^((1 + 9 q (t)Y*(m)) 2 - (l + g q (t)<?(m)j) 

+ g((, +ft( rw)-(.H^w)) ^ 1) g w . 

Observing that X(m) <C F(m) 2 , which holds since t ^ u>, we deduce that 

E[AF(to)] -AF(m) G gi^^L ( - 2 F*(m) + Q*(m) ± o(l)) (37) 
if \X*{m)\ + |y*(m)| + \Q*(m)\ = 0(1), which follows from the event Q(m). 
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Now, combining (32), (35) and (37), we obtain 

E[AF*(to)] e ^|^(-2F*(m) + Q*(m)±o(l)) + (l±o(l))^(F*(m)+E[AF*(m)]), 
which easily implies that 

E[AF*(to)] G (lie) • ( - 3F*(m) + 2Q*(m) ± , 

as required. 

Finally, we turn to X (to). By Lemma 5.28, we have 

iTATV M E[AX(to)-AX(to)] Afa(t)X(m)) r * . i 

EAX(to) = • E X (to + 1) G m , (38) 

g q (t)X(m) g q (t)X(m) L J 

and by Lemma 6.5, if A'(m) D iV(m) D Q(to) holds then 

E[Al(m)] G ^^^-2-X(m)F(m)-3-Cov(X,F)io(x(m) + r(m) 2 ^. (39) 

Now, since X(to) = 2e _8 * 2 n, a simple calculation gives 

. ~ / x 16t ~, N , 1 2-X(to)-F(to) , 1 

Ai m G ^ • X to) i — C K-^ ^ i — , 40 

V ' n 3 / 2 V ; n 2 ~ Q( m ) n 2 ' v ; 

and similarly, since g q (t)X(m) = 2e~ 6 * 2 n 3 / 4 (logn) 3 and t ^ w, we obtain 

12/ _ 1 12t 

A(g q (t)X(m))e--- 2 -g q (t)X(m)±- C - (l i o(l)) ■ — • ^(t)X(m). (41) 

II/' lb I V ' 

Moreover, since X(m) r\y(m) holds, we have 

Cov(X e {m),Y e {m)) < g x (t)X(m)-g y (t)Y(m) « g q (t)X(m)Y(m), (42) 

and note also that X(to) + Y(m) 2 <C g q (t)X(m)Y(m) for every to ^ to*. 

Combining (39), (40) and (42), it follows that E[AX(to)] - AX (to) is contained in 

2 ' ^QM ( " (1 + 9qit)T (m)) (1 + (m)) + (1 + ^ (t)g * (m) ) ± ' ^ (t) ) ' 

and hence 

2 • g q (tyX(m)Y (m 
Q(m) 

Finally, it follows from (38), (41) and (43) that 



e[ax(to)1 -ax(to) g 2 • fl g (t)x(m)y(TO) / _ T _ T + g ± \ (43) 



E[AX*(to)] G 2 ( - X*(to) ~ Y*(rn) + Q*{m) i o(l)) 

+ (lio(l))-^-(r(m)+E[Ar(TO)]), 
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which easily implies that 



E[AX*(m)] E (l±e).J^(-X*(m)-4y*(m)+4Q*(m)±e), 



At 

n 3 

as required. □ 
Finally, let's prove Lemma 6.7. Recall that 

Y* \ ( 4 5 \ / A 



Q* J \4 3 J \ fi 
and A(m) = A(m) 2 + fi(m) 2 . 

Lemma 6.7. Let ui ■ n 3 / 2 < m < m* 7 and suppose that X(m) fl iV(m) fl Q(m) /io/ds. TTien 

(logn) 3 



|AX (m)| + | AY (m)\ + |AQ*(m)| < 



n 



3/2 : 



and /ience 

We shall use the following bound, which follows easily from Lemma 4.23. 

Lemma A. 5. 4. Let A e {X,Y,Q} and let A be the corresponding member of {X ,Y ,Q} . 
For every cu ■ n 3 / 2 < m ^ m* , if Q(m) holds, then 



, a a*, m ^ 2 /|AA(m)| logn 



A(m) 



n 



3/2 



Proof. Since A(m) is equal to either te kf2 or e fc * 2 times some function of n, where k G {4, 8}, 
g q (t)A(m) is equal to e -( fc - 2 )* times some function of n, and t > u, we have 

~. . -2fct±o(l) ~. , , a / / \ n / \\ -(2k-A)t±o(l) , N -r, x 
AA ( m ) G n3/2 • and A(0,(t)A(m)) e — ^ • &(f)A(m), 

and hence 

logn ~ / s j , A / w ?/ Ul logn 

n; 



\AA{m)\ < -A^- . A(m) and | A A(m)) | < -A_ . g q (t)A{m). 



Moreover, the event Q(m) implies that A(m) ^ (l + g q (t))A(m), and g q (t) <C 1. Thus, 
applying Lemma 4.23, we obtain 

, a ,*/ \ i 2 (\AA{m)\ logn 



A{m) n 3/2 

as claimed. □ 



Proof of Lemma 6.7. By Lemma A. 5. 4, we have 

AX(m)| |AF(m)| \AQ(m)\ k 
&(f) V X(m) y(m) Q(m) n 3/2 



a^7*/ \ i , A T7*/ \ i , . m „ 6 /|AX(m)| |Ay(m)| |AQ(m)| logn 
AX (m)| + |Ay (m)| + |AQ*(m)| ^ — - 1 - / /' + 1 _ / /' + 1 _/ /' + 
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so it will suffice to prove that 

(\AX(m)\ \AY(m)\ \AQ(m)\) \ogn 

max < — = , — = , — = > ^ 

I X(m) Y(m) Q{m) J 



n 



3/2 



For Q(m), the bound is trivial, since if y(m) holds then AQ{m) G —(1 ±e)Y(m). To prove 
the bound for Y(to), recall first (see (28)) that 

AY(to) = X e (m) - 2 Y f( m )i 

feY e (m) 

and observe that therefore, since X{m) fl y(m) holds, 

AY(m) G (l±e)(x(m) -2-Y(m) 2 ). 
Since X{m) Y(m) 2 for t ^ w, and AQ{m) G — (1 ± e)Y(m), it follows that 

Q(m + 1) G V Q(m)J\ Q{m)-Y{m) 

and hence 

|AY(to)| < (l±t^)( Y ^ + ^Z 2 -^ 2 )-Y im) < 



Q(m)7V Q(to)-Y(to) / Q(m) ' 

as required. 

Finally, to prove the bound for X(m), recall from (31) that 

-3-X e (m) ^ AX(m) + 3 X f( m ) < 6 • Y e (m) 2 

/eYe(m) 

and observe that therefore, since X(m) fl y(m) holds, 

AX(to) G -(3±e) -X(to)Y(to). 

As before, it follows that 

X(m + 1) - X(m + 1 ) e f 1± Y(m)\ fX(m) -3- X(m)Y(m) 



Q(m + 1) V Q(m)J\ Q{m)-Y{m 
and hence 

A3f(m) « f lt MpW-»W'-W). ?M 4 4.x W y W 



<5(to) / V <5(to) — Y(to) / Q(m) 

as required. As noted above, it follows from Lemma A. 5. 4 and our bounds on |AQ(m)|, 
\AY(m)\ and \AX(m)\ that 

(logn) 3 



\AX (to)| + | AY (to) | + |AQ*(to)| < 



g q (t).nW 
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Finally, in order to deduce the claimed bounds on |AA(m)| and E[|AA(m)|] , simply note 
that 

|AA(m)| < 2(jA(m) • AA(m)| + \fx(m) • A^(m)|) + |AA(m)| 2 + \A/i(m)\ 2 , 

and that 

|AA(m)| + |A/x(m)| =<3(|AF*(m)| + |AQ*(m)|) and |A(m)| + \n(m)\ = O(l), 
since the event Q(m) holds. It follows immediately that 

and therefore that the same bound holds for E[]AA(m)|] , as required. □ 

6. The martingale inequalities 

In this section, for completeness, we shall give the proof our main martingale lemma [2, 
Lemma 3.1]. The proof below is taken from the survey of McDiarmid [3]. Recall that we 
assume throughout that for each martingale M we consider, M{m) depends only on the 
graph G m+r for each m G [0, s], for some (fixed) r G N depending on M. 

Lemma 3.1 (Theorem 3.15 of [3]). Let M be a super-martingale, defined on [0, s], such that 

\AM(m)\ ^ a and E[|AM(m)|] ^ /3 (44) 
for every m G [0, s — 1]. Then, for every ^ a ^ (3s, 



P(M(s) > M(0) + a) ^ exp 
We begin with a straightforward observation. 



a 2 



Aa/3s 



Lemma A. 6.1 (Lemma 2.8 of [3]). Let X be a random variable with E[X] ^ and X ^ b, 
and for each x G R \ {0} 7 set 

e x — x — 1 



x 2 



The function g is increasing onl\ {0} 7 and 

E[e x ] ^ exp(^(6)E[X 2 ]). (45) 

Proof. The fact that g is increasing follows from simple calculus. Indeed, for each x G IR\{0}, 

= (*-2)e- + s + 2 „ 

X 6 

since h(x) — (x — 2)e x + x + 2 satisfies /i(0) = and — (x — l)e x + 1 > for every 

To see the latter bound, simply note that h'(0) = and h"{x) = xe x . 
To deduce (45), simply note that 

e x = 1 + x + x 2 g(x) < 1 + x + rc 2 ^) 
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for every x ^ b (setting g(0) = 0). Since E[X] ^ and X ^ b, it follows immediately that 

E[e x ] <: l+g(b)E[X 2 } <: exp (g(b)E[X 2 f) , 
as required. □ 
We next prove another relatively straightforward preliminary lemma. 

Lemma A. 6. 2 (Lemma 3.16 of [3]). Let M be a super-martingale defined on [0,s], and let 
r G N. Suppose that M(m) depends only on G m+r for each m e [0, s]. Then, for any fteR, 

s-l 



E 



3 /i(Af(s)-M(0)) 



G, 



sup 



n*[. 

m=0 



m+r 



G, 



(46) 



Proof. The proof is by induction on s. When s = the result is trivial, since (46) reduces 
to 1 < 1. So let s > 1, and suppose the result holds for s - 1. Set A = e h (M( s yM(i)) and 

s-l 
m=l 

By the induction hypothesis, we have E [A | G r +i] < E[£?|G r+ i], and note that trivially 
sup (B I G r ) ^ sup (B I G r+ ij. It follows that 



E 



D h(M(s)-M(0)) 



Gi 



E 



e hAM ^E[A\G r+1 ] 



< E 



e hAM(0) sup ( B | Gr+1 ) 



G r 
^ E 



= sup (B | G^Efe^W | G r ] = sup ( JJlJe^^ | G m+r ] | G r ) 



□ 



as required. 

We shall use one more trivial observation, which is an immediate consequence of [3, 
Lemma 2.4]. (Here log denotes the natural logarithm.) 



Observation A. 6. 3. For every ^ x ^ 1, 



(1 + x) log(l + x) - x > 



ar 



We are now ready to prove Lemma 3.1. 

Proof of Lemma 3.1. Note that E[AM(m)] 0, since M is a super-martingale, and recall 
that \AM(m)\ ^ a. Applying Lemma A. 6.1 to the random variable X = h ■ AM(m), it 
follows that 



E 



e hAM ^\G m+r ] ^ e W (h 2 g(ha)E[(AM(m)) 2 ]y 



Next, note that 

E[(AM(m)) 2 ] ^ af3, 
since \AM(m)\ ^ a and E[|AM(m)|] < (3. 



(47) 
(48) 
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Now, combining Lemma A. 6. 2 with (47) and (48), we obtain 



E 



h(M{s)-M(ti)) 



G r 



sup 



s-1 



m=0 

s-1 



e AAM(m) I q 



m+r 



sup I J\ E^exp (h 2 g(ha)E[(AM(m)) 2 ] 



m=0 



G 



m+r 



^ exp (h 2 g(ha)a(3s^. 
Thus, for any h > 0, by Markov's inequality, 

P(M(s) > M(0) + a) = p(>(M( s )-M(o)) ^ e fea ) 



< e- ha E 



D h(M(s)-M(0)) 



Setting h = ^ log (^f^), and noting that 



-/ia + h 2 g(ha)a(3s = log 

a 



/3s 



a + (3s 



G r ^ exp ^ — ha + h 2 g(ha)a(3s^ . 
(3sfa + (3s fa + (3s s 



+ 



l + ^llog(l + 



as 
a 



+ 



log 

a 



(is) f3s 

Finally, applying Observation A. 6. 3 with x = a/ (3s, it follows that 

1 / a 



P(M(s) > M (0) + a) ^ exp ^ - 



- 



a 4 \/?s 



exp - 



4a(3s 



since a ^ (3s, as required. 
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